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(57) Abstract 

Antibody Directed Enzyme Prodrug Therapy (ADEPT) systems for use in cancer based on mutated carboxypeptidase B (CPB) 
enzymes. Enzyme conjugates for ADEPT are substantially non-immunogenic in humans comprising a targeting moiety (for example an 
antibody) capable of binding with a tumour associated antigen, the targeting moiety being linked to a mutated form of a CPB enzynie 
capable of converting a prodrug into an antineoplastic drug wherein die prodrug is not significanUy convertible into antineoplastic drug in 
humans by natural unmutated enzyme. A preferred enzyme mutant is human pancreatic CPB comprising a Lys or Arg residue at position 
253. Suitable mustard prodrugs are disclosed in the specification. 
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CHEMICAL COMPOUNDS 

The invention relates to mutant CPB enzymes for use with 
prodrugs in ADEPT systems* 

Abbreviations 



Ac acetyl 

ADEPT antibody directed enzyme prodrug therapy 

BOC tert-butoxycarbonyl 

CPB carboxypeptidase B 

DCCI 1 , 3-dicyclohexylcarbodiimide 

DMAP 4-dimethylaminopyridine 

DMF N,N-dimethyl-formamide 

DMSO diroethylsulf oxide 

Et ethyl 

EDCI l-(3-dimethylaminopropyl)-3-ethyl-carbodiimide 

HCPB human CPB 

HOBT 1-hydroxybenzotriazole 

PCR polymerase chain reaction 

TFA trifluoroacetic acid 

THF tetrahydrofuran 



Targeting of drugs selectively to kill cancer cells in a 
patient has long been a problem for medical research. ADEPT is one 
approach to overcome the problem. ADEPT uses a tumour selective agent 
such as an antibody conjugated to an enzyme. The conjugate is 
administered to the patient (usually intravenously), allowed to 
localise at the tumour site(s) and clear from the general circulation. 
Subsequently a prodrug is administered to the patient which is 
converted by the enzyme (localised at the tumour sites) into a 
cytotoxic drug which kills tumour cells. Since one molecule of enzyme 
can catalyse generation of many cytotoxic drug molecules an 
amplification effect is produced. Furthermore tumour cells not 
displaying the antigen recognised by the antibody (tumours usually 
display microheterogeneity) are also killed by enzymically amplified 
generation of the cytotoxic drug. A known system uses the procaryotic 
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enzyme carboxypeptidase G2 (CPG2) as the enzyme component (see VO 
88/07378). 

A further problem with known systems is that repeated 
administration of the conjugate results in a host immune response 
rendering the therapy less effective. The antibody component is 
generally a mouse monclonal antibody which can be humanised using 
known techniques to reduce immunogenic ity. However reduction of the 
immunogenicity of the enzyme component has proved more problematic. 
This is because the enzyme component must not be present naturally in 
the human host circulation otherwise premature conversion of prodrug 
to cytotoxic drug will occur and little selective toxicity to tumours 
will be observed. 

These problems have been addressed in part by International 
Patent application VO 95/13095 (Wellcome Foundation). This 
application proposed ADEPT using mutant mammalian enzjmies to activate 
prodrugs which are not activated by the corresponding native enz3nne. 
However only ADEPT systems using mutants of carboxypeptidase A were 
enabled in the disclosure. 

According to one aspect of the present invention there is 
provided a conjugate which is substantially non- immunogenic in humans 
comprising a targeting moiety capable of binding with a tumour 
associated antigen, the targeting moiety being linked to a mutated 
form of a carboxypeptidase B (CPB) enzjnne capable of converting a 
prodrug into an antineoplastic drug wherein the prodrug is not 
significantly convertible into antineoplastic drug in humans by 
natural unmutated enzyme. 

Preferably the targeting moiety is an antibody. 

Preferably the antibody is a F(ab')2 antibody fragment. 

Preferably the enzyme is mutated to comprise a polarity 
change in its active site such that it can turn over a prodrug with a 
complementary polarity. 

Preferably the enzyme is any one of the following pancreatic 
human CPB mutants: 

pancreatic human CPB having amino acid Asp 253 substituted by any one 
of Argy Asn, Gin or Lys optionally in combination with any one or more 
amino acid substitutions selected from: 
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natural amino acid Gin 54 substituted by any one of Arg, Lys or Asn; 
natural amino acid Asp 145 substituted by any one of Val, Leu, He or 
Ala; 

natural amino acid He 201 substituted by any one of Ser or Thr; 
natural amino acid Ser 205 substituted by any one of Asn, Gin, His, 
Lys or Arg; 

natural amino acid He 245 substituted by any one of Ser, Thr, Ala, 
Val, Leu, Asn, Gin, Lys, Arg or His; 

natural amino acid Ala 248 substituted by any one of Asn, Gin, Lys, 
Arg, His, Ser or Thr; 

natural amino acid Gly 251 substituted by any one of Thr, Asn, Ser, 
Gin, His, Lys, Arg, Val, He, Leu, Met, Phe, Ala or Norleucine; and 
natural amino acid Cys 288 substituted by any one of Ser, Thr, Ala, 
Val, Leu or He. 

More preferably the enzyme is any one of the following 
pancreatic human CPB mutants: 

pancreatic human CPB having natural amino acid Asp 253 substituted by 
any one of Arg or Lys and natural amino acid Gly 251 substituted by 
any one of Thr, Asn, Ser, Gin, Lys or Val, optionally in combination 
with any one or more amino acid substitutions selected from: 
natural amino acid Gin 54 substituted by Arg; 
natural amino acid Asp 145 substituted by Ala; 
natural amino acid He 201 substituted by Ser; 
natural amino acid Ser 205 substituted by Asn; 

natural amino acid He 245 substituted by any one of Ser, Ala or His; 
natural amino acid Ala 248 substituted by any one of His, Ser or Asn; 
and 

natural amino acid Cys 288 substituted by any one of Ser or Ala. 

More preferably the enzyme is any one of the following 
pancreatic human CPB mutants: 

pancreatic human CPB having natural amino acid Asp 253 substituted by 
any one of Arg or Lys and natural amino acid Gly 251 substituted by 
any one of Thr, Asn or Ser optionally in combination with any one or 
more amino acid substitutions selected from: 
natural amino acid Gin 54 substituted by Arg; 
natural amino acid Asp 145 substituted by Ala; 
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natural amino acid lie 201 substituted by Ser; 
natural amino acid Ser 205 substituted by Asn; 
natural amino acid lie 245 substituted by Ala; 

natural amino acid Ala 248 substituted by any one of Ser or Asn; and 
natural amino acid Cys 288 substituted by Ser. 

Especially the enzjnne is any one of the following pancreatic 
human CPB mutants: 

D253K; D253R; IG251N, D253K] ; [G251T, D253K] ; [G251S, D253K] ; 
IG251T, D253R1; [A248S,G251T,D253K] ; [A248N,G251N,D253K] ; 
[A248S,G251N,D253KJ; or [S205N,G251N,D253K1 . 

According to another aspect of the present invention there 
is provided a matched two component system designed for use in a host 
in which the components comprise: 

(i) a first component that is a targeting moiety capable of 
binding with a tumour associated antigen , the targeting moiety being 
linked to a CPB enzyme capable of converting a prodrug into an 
antineoplastic drug and; 

(ii) a second component that is a prodrug convertible under the 
influence of the enzjmie to the antineoplastic drug; 

wherein: 

the enzyme is a mutated form of a CPB enzyme; 

the first component is substantially non- immunogenic in the host and; 
the prodrug is not significantly convertible into antineoplastic drug 
in the host by natural unmutated host enz}nne. 

The term "the prodrug is not significantly convertible into 
antineoplastic drug in the host by natural unmutated host enzyme** 
means that the prodrug does not give undue untargeted toxicity 
problems on administration to the host. 

The term "substantially non- immunogenic" means that the 
first component (conjugate) can be administered to the host on more 
than one occasion without causing significant host immune response as 
would be seen with for example the use of a mouse antibody linked to a 
bacterial enzyme in a human host. 

Preferably the mutated enzj^e is based on an enzyme from the 
same species as the host for which the system is intended for use but 
the mutated enzyme may be based on a host enz3nne from a different 
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species as long as the structure of the enzyme is sufficiently 
conserved between species so as not to create undue immunogenicity 
problems . 

Preferably the targeting moiety is an antibody, especially 
an antibody fragment such as for example F(ab')2* Linkage to enzyme 
for conjugate synthesis may be effected by knovn methods such as use 
of heterobifunctional reagents as cross-linkers or by gene fusion or 
any other suitable method. Antibody may be from the same host (eg use 
of mouse antibody in mice) or the antibody may be manipulated such 
that it is not significantly recognised as foreign in the chosen host 
(eg use of chimeric, CDR grafted or veneered mouse antibodies in 
humans). Preferably the first component is a conjugate as defined 
above. 

Transplantation of the variable domains of rodent antibodies 
into the constant domains of human antibodies (Chimeric antibodies) or 
building the antigen binding loops (CDRs) of rodent antibodies into a 
human antibody (CDR grafting) have both been shovn to greatly decrease 
the immunogenicity of the rodent antibody in preclinical studies in 
monkeys and in patients. Even CDR grafted antibodies incorporate a 
large number (>S0) of amino acids from the rodent antibody sequence 
into the human framevork. Despite this in monkeys and patients greatly 
reduced immunogenicity has been reported. This provides evidence that 
mutating a very limited number of amino acids in the catalytic site of 
a host enzyme is likely to result in an enzyme with minimal 
immunogenicity and certainly lower immunogenicity than a non-host 
enzyme. The reader is directed to the following references: A. 
Mountain and J. R. Adair, Biotechnology and Genetic Engineering 
Reviews 10, 1-142, 1992; G. Winter and W. J. Harris, Trends in 
Pharmacological Sciences, 14, 139-143, 1993; I.I. Singer et al, J. 
Immunol, 150, 2844-57, 1993; J. Hakimi et al, J. Immunol, 147 , 
11352-59, 1991 and; J. D. Isacs et al, The Lancet, 340, 748-752, 1992. 
The constant region domains may be for example human IgA, IgE, IgG or 
IgM domains. Human IgG2 and 3 (especially IgG2) are preferred but IgG 
1 and 4 isotypes may also be used. Human antibodies per se may also 
be used such as those generated in mice engineered to produce human 
antibodies. (Fishwald et al. in Nature Biotechnology (1996), 14, 
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The host enzyme is mutated to give a change in mode of 
interaction between enzyme and prodrug in terms of recognition of 
substrate compared vith the native host enzyme. 

Preferably the enzyme mutation is a polarity change in its 
active site such that it turns over a prodrug vith a complementary 
polarity; the prodrug not being significantly turned over by the 
unmutated host enzyme. Preferably the natural host enzyme recognises 
its natural substrate by an ion pair interaction and this interaction 
is reversed in the design of mutated enz3rme and complementary prodrug* 
In this specification the term "active site" includes amino acid 
residues involved in any aspect of substrate recognition and/or 
catalytic functionality. 

Point mutations vill be referred to as follows: natural 
amino acid (using the 1 letter nomenclature) , position, new amino 
acid. For example ''D253K" means that at position 253 of mature active 
HCPB an aspartic acid (D) has been changed to lysine (K). Multiple 
mutations in one enz3nne vill be shown between square brackets with 
individual mutations separated by commas. 

In this specification the term CPB includes the following: 

i) mature, pro and prepro forms of the enzyme with or without "tags" 
(eg c-myc); 

ii) any carboxypeptidase with specificity for peptidic substrates 
having Lys or Arg at the C terminus having substantial sequence 
identity (preferably at least 60% identity, more preferably at least 
70% identity, more preferably at least 80% identity and especially at 
least 90% identity) with mature active pancreatic HCPB within each of 
the key substrate binding sites 187-206 and 238-268; 

preferably human pancreatic and plasma CPB enzymes (the pancreatic 
enzjnne disclosed herein is preferred); 

unless indicated otherwise or self evident from the context. 
Naturally occurring allelic variants of CPBs are also contemplated. 
An allelic variant is an alternate form of sequence which may have a 
substitution, deletion or addition at one or more positions, which 
does not substantially alter the function of the CPB. 
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To determine the degree of identity between a 
carboxypeptidase and mature active pancreatic HCPB at its key 
substrate binding sites the following procedure is followed to align 
the sequences. Uhen amino acid residues 109 to 415 of SEQ ID NO: 39 
are renumbered 1 to 307, and aligned with other carboxypeptidases 
using a Clustal method with PAI1250 residue weighting as described in 
the LASERGENE biocomputing software for MACINTOSH User's Guide, A 
Manual for the LASERGENE system (2nd Edition, 1994, published by 
DNASTAR Inc., 1228 South Park Street, Madison, Wisconsin 53715, USA) 
the key zinc binding residues (at H66, E69 and H194), the key 
terminal-carboxy substrate binding residues (at R124, N141, R142, and 
Y246) and the catalytic residues (at R124, Y246 and E268) are 
essentially aligned. The key substrate recognition residue is deemed 
to be D253, with the substrate recognition pocket lying between the 
core p-sheet (including residues 187 to 206) and the active-site 
surface loop and helix (residues 238 to 268). Residues 263-268 
(within sequence 238-268) are beta strand, although they are part of 
the core beta sheet. 

Mutant CPBs of the invention are mutants of any of the above 
CPBs having the desired property required for the invention. The 
following mutants of pancreatic HCPB are preferred: D253K, D253R and; 
especially [G251N,D253RJ ; corresponding mutations in other CPBs are 
also contemplated. Key mutation positions are also set out in the 
following table. 
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A mutant CFB of the invention may also comprise other 
"conservative" mutations (Insertions, substitutions and/or deletions) 
that do not significantly alter the properties of the key mutation. 
For the purposes of this document a conservative amino acid 
substitution is a substitution whose probability of occurring in 
nature is greater than ten times the probability of that substitution 
occurring by chance (as defined by the computational methods described 
by Dayhoff et al, Atlas of Protein Sequence and Structure, 1971, page 
95-96 and figure 9-10) and as set out in the following table. 



CONSERVATIVE SUBSTITDTIONS 







rreierreu 




Substitutions 


Substitutions 


Ala (A) 


val; leu; ile 


val 


Arg (R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


Asp (D) 


glu 


glu 


Cys (C) 


ser 


ser 


Gin (Q) 


asn 


asn 


Glu (E) 


asp 


asp 


Gly (G) 


pro 


pro 


His (H) 


asn; gin; lys; arg 


arg 


Ile (I) 


leu; val; met; ala; phe; 
norleucine 


leu 


Leu (L) 


norleucine; lie; val; 
met; ala; phe 


ile 


Lys (K) 


arg; gin; asn 


arg 


Met (M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; lie; ala 

gly 


leu 


Pro (P) 


gly 


Ser (S) 


thr 


thr 


Thr (T) 


ser 


ser 


Trp (W) 


tyr 


tyr 


Tyr (Y) 


trp; phe; thr; ser 


phe 


Val (V) 


ile; leu; met; phe; 
ala; norleucine 


leu 



References on CPBs include the following: Folk JE in The 
Enzymes Vol III, Academic Press (1971), pg 57; Coll M et al (1991) 
EMBO Journal 10, 1-9; Eaton DL et al (1991) J Biol Chem 266, 
21833-21838; Yamamoto K et al (1992) J Biol Chem 267, 2575-2581; US 
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Patent 5364934 (Genentech) and; International Patent Application UO 
95/14096 (Eli Lilly). 

According to another aspect of the present invention there 
is provided a system as hereinbefore defined for use in a method of 
controlling the growth of neoplastic cells in a host in which the 
method comprises administration to said host an effective amount of a 
first component, allowing the first component to clear substantially 
from the general circulation, and administering an effective amount of 
a second component. Preferably the components are administered 
intravenously. 

According to another aspect of the invention there is 
provided a method of treating neoplastic cells in a host in which the 
method comprises administration to said host an effective amount of a 
first component, allowing the first component to clear substantially 
from the general circulation, and administering an effective amount of 
a second component wherein the components form a two component system 
as defined herein. Preferably the components are administered 
intravenously. 

According to another aspect of the present invention there 
is provided a pharmaceutical composition comprising an effective 
tumour localising amount of a first component as hereinbefore defined 
and a pharmaceutically acceptable carrier or diluent. Preferably the 
composition is suitable for intravenous administration. Preferably 
the first component is supplied as a dry solid which is reconstituted 
before use with a suitable diluent. 

According to another aspect of the present invention there 
is provided a pharmaceutical composition comprising an effective 
antitumour amount of a second component as hereinbefore defined and a 
pharmaceutically acceptable carrier or diluent. Preferably the 
composition is suitable for intravenous administration. Preferably 
the second component is supplied as a dry solid which is reconstituted 
before use with a suitable diluent. 

E.coli MSD 1646 containing pCG330 (also known as pICI1698) 
was deposited under the Budapest Treaty on 23rd November 1994 with the 
National Collection of Industrial and Marine Bacteria (NCIHB), 23 St 
Machar Drive, Aberdeen, Scotland, United Kingdom AB2 IRY; the 
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accession number is NCIHB 40694. NCIHB 40694 is another aspect of 
the present invention* 

Antibody A5B7 was deposited as hybridoma deposit reference 
93071411 under the Budapest Treaty on 14th July 1993 at ECACC, PHLS 
Centre for Applied Microbiology & Research, Porton Down, Salisbury, 
Wiltshire SP4 OJG, OK. A humanised antibody A5B7 in the form of a 
F(ab')2 is preferred. 

Antibody 806.077 was deposited as hybridoma deposit 
reference 96022936 under the Budapest Treaty on 29th February 1996 at 
ECACC, PHLS centre for Applied Microbiology & Research, Porton Down, 
Salisbury, Wiltshire SP4 OJG, UK. Antibody 806.077 is an alternative 
anti-CEA antibody to A5B7 which is suitable for use in the present 
invention. 

According to another aspect of the present invention there 
is provided a method of making a first component (conjugate) as 
herein described by linking: 

a targeting moiety capable of binding with a tumour associated antigen 
and; 

an enzyme capable of converting a prodrug into an antineoplastic drug 
wherein the enzjme is a mutated form of a host CPB enzyme. Linking 
may be effected by chemical or molecular biological techniques. 

According to another aspect of the present invention there 
is provided a first component of the present invention. 

According to another aspect of the present invention there 
is provided a polynucleotide sequence capable of encoding a first 
component (conjugate) of the present invention. 

According to another aspect of the present invention there 
is provided a vector comprising a polynucleotide sequence capable of 
encoding a first component of the present invention. 

According to another aspect of the present invention there 
is provided a cell comprising a vector or a poljmucleotide sequence 
capable of encoding a first component of the present invention. 

According to another aspect of the present invention there 
is provided a mutant CPB enzyme having the desired properties of the 
invention. 
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According to another aspect of the present invention there 
is provided a polynucleotide sequence capable of encoding a mutant CPB 
enzyme of the present invention* The present invention further 
relates to polynucleotides which hybridize to the polynucleotides 
encoding mutant CPBs if there is at least 70% identity between the 
sequences. The present invention particularly relates to 
polynucleotides which hybridize under stringent conditions to the 
hereinabove-described poljmucleotides* As herein used, the term 
"stringent conditions" means hybridization will occur only if there is 
at least 95Z and preferably at least 97% identity between the 
sequences. 

According to another aspect of the present invention there 
is provided a vector comprising a pol3mucleotide sequence capable of 
encoding a mutant CPB enzyme of the present invention. The 
polynucleotide sequence may be included in any one of a variety of 
expression vehicles, in particular vectors or plasmids for expressing 
a polypeptide. Such vectors Include chromosomal i nonchromosomal and 
sjmthetic DNA sequences, e.g., derivatives of SV40; bacterial 
plasmids; phage DNA; yeast plasmids; vectors derived from combinations 
of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, 
fowl pox virus, and pseudorabies. However, any other plasmid or vector 
may be used as long they are replicable and viable in the host. The 
appropriate DNA sequence may be inserted into the vector by a variety 
of procedures. 

In general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures known in the art. Such 
procedures and others are deemed to be within the scope of those 
skilled in the art. The DNA sequence in the expression vector is 
operatively linked to an appropriate expression control sequence(s) 
(promoter) to direct mRNA synthesis. As representative examples of 
such promoters, there may be mentioned: LTR or SV40 promoter, the 
E. coli. lac or trp, the phage lambda P.sub.L promoter and other 
promoters known to control expression of genes in prokaryotic or 
eukaryotic cells or their viruses. The expression vector also contains 
a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for 



SUBSTITUTE SHEET (RULE 26) 



WO^/07769 



- 14 - 



PCT/GB96/01975 



amplifying expression. In addition, the expression vectors preferably 
contain a gene to provide a phenotypic trait for selection of 
transformed host cells such as dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture, or such as tetracycline or 
ampicillin resistance in E. coli. The vector containing the 
appropriate DNA sequence as hereinabove described, as veil as an 
appropriate promoter or control sequence, may be employed to transform 
an appropriate host to permit the host to express the protein. As 
representative examples of appropriate hosts, there may be mentioned: 
bacterial cells, such as E. coli, Streptorayces , Salmonella 
lyphimurium; fungal cells, such as yeast; insect cells such as 
Drosophila and Sf9; animal cells such as NSO,CHO, COS or Bowes 
melanoma; plant cells, etc. The selection of an appropriate host is 
deemed to be within the scope of those skilled in the art from the 
teachings herein. More particularly, the present invention also 
includes recombinant constructs comprising one or more of the 
sequences as broadly described above. The constructs comprise a 
vector, such as a plasmid or viral vector, into which a sequence of 
the invention has been inserted, in a forward or reverse orientation. 
In a preferred aspect of this embodiment, the construct further 
comprises regulatory sequences, including, for example, a promoter, 
operably linked to the sequence. 

Large numbers of suitable vectors and promoters are known to 
those of skill in the art, and are commercially available. The 
following vectors are provided by way of example. Bacterial: pQE70, 
PQE60, pQE-9 (Qiagen), pbs, pDlO, phagescript, psiX174, pbluescript 
SK, pbsks, pNH8A,pNH16a, pNHlSA, pNH46A (Stratagene) ; ptrc99a, 
pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLNEO, 
pSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). However, any other plasmid or vector may be used as long 
as they are replicable and viable in the host. Promoter regions can 
be selected from any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are PKK232-8 and PCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P.sub.R, P. sub. L and 
trp. Eukaryotic promoters include CMV immediate early, HSV thymidine 
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kinase, early and late SV40, LTRs from retrovirus, and mouse 
metallothioneln-I. Selection of the appropriate vector and promoter is 
well within the level of ordinary skill in the art. 

In a further embodiment, the present invention relates to 
host cells containing the above-described construct. The host cell can 
be a higher eukaryotic cell, such as a mammalian cell, or a lower 
eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. Introduction of the 
construct into the host cell can be effected by for example calcium 
phosphate transfection, DEAE-Dextran mediated transfection, 
lipofection (cationic lipid-mediated delivery of polynucleotides 
[Feigner et al. in Methods: A Companion to Methods in Enzymology 
(1993) 5, 67-75] or electroporation (Davis, L. , Dibner, M., Battey, 
I., Basic Methods in Molecular Biology, (1986)). The skilled reader 
will be able to select the most appropriate method for a given host. 
The constructs in host cells can be used in a conventional manner to 
produce the gene product encoded by the recombinant sequence. 
Alternatively, the polypeptides of the invention can be synthetically 
produced by conventional peptide synthesizers. Mature proteins can 
be expressed in mammalian cells, yeast, bacteria, or other cells under 
the control of appropriate promoters. Cell-free translation systems 
can also be employed to produce such proteins using RNAs derived from 
the DNA constructs of the present Invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are 
described by Sambrook, et al., Molecular Cloning: A Laboratory Manual, 
Second Edition, Cold Spring Harbor, N.Y., (1989), the disclosure of 
which is hereby incorporated by reference. 

Transcription of the DNA encoding the polypeptides of the 
present invention by higher eukaryotes is increased by inserting an 
enhancer sequence into the vector. Enhancers are cis-acting elements 
of DNA, usually about from 10 to 300 bp that act on a promoter to 
increase its transcription. Examples including the SV40 enhancer on 
the late side of the replication origin bp 100 to 270, a 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
late side of the replication origin, and adenovirus enhancers. 
Generally, recombinant expression vectors will include origins of 
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replication and selectable markers permitting transformation of the 
host cell, e.g. I the ampicillin resistance gene of E. Coli and S. 
cerevisiae TRFl gene, and a promoter derived from a highly-expressed 
gene to direct transcription of a downstream structural sequence. 
Such promoters can be derived from operons encoding glycolytic enzymes 
such as 3-phosphoglycerate kinase (FGK), alpha-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation 
initiation and termination sequences, and preferably, a leader 
sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the 
heterologous sequence can encode a fusion protein including an 
N- terminal identification peptide imparting desired characteristics, 
e.g., stabilization or simplified purification of expressed 
recombinant product. Useful expression vectors for bacterial use are 
constructed by inserting a structural DNA sequence encoding a desired 
protein together with suitable translation initiation and termination 
signals in operable reading phase vith a functional promoter. The 
vector will comprise one or more phenotypic selectable markers and an 
origin of replication to ensure maintenance of the vector and to, if 
desirable, provide amplification vithin the host. Suitable 
prokaryotic hosts for transformation include E. coli. Bacillus 
subtilis. Salmonella typhimurium and various species vithin the genera 
Pseudomonas, Streptomyces , and Staphylococcus, although others may 
also be employed as a matter of choice. As a representative but 
nonlimiting example, useful expression vectors for bacterial use can 
comprise a selectable marker and bacterial origin of replication 
derived from commercially available plasmids comprising genetic 
elements of the well known cloning vector pBR322 (ATCC 37017), pAT153 
and pBluescript. Such commercial vectors include, for example, 
pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEHl (Promega 
Biotec, Hadison, VI, USA). These pBR322 "backbone" sections are 
combined with an appropriate promoter and the structural sequence to 
be expressed. Following transformation of a suitable host strain and 
growth of the host strain to an appropriate cell density, the selected 
promoter is induced by appropriate means (e.g., temperature shift or 
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chemical induction) and cells are cultured for an additional period. 

Cells are tjrpically harvested by centrifugation^ disrupted 
by physical or chemical means, and the resulting crude extract 
retained for further purification. Microbial cells employed in 
expression of proteins can be disrupted by any convenient method, 
including freeze- thaw cycling, sonication, mechanical disruption, or 
use of cell lysing agents, such methods are veil known to those 
skilled in the art. Various mammalian cell culture systems can also 
be employed to express recombinant protein. Examples of mammalian 
expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell, 23:175 (1981), and other cell 
lines capable of expressing a compatible vector, for example, the NSO, 
C127, 3T3, CHO, HeLa and BHK cell lines. Mammalian expression vectors 
will comprise an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5 'flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites 
may be used to provide the required nontranscribed genetic elements. 
Expression products are recovered and purified from recombinant cell 
cultures by methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography hydroxylapatite 
chromatography and lectin chromatography. It is preferred to have low 
concentrations (approximately 0.15-5 mM) of calcium ion present during 
purification. (Price et al., J. Biol. Chem., 244:917 (1969). Protein 
refolding steps can be used, as necessary, in completing configuration 
of the mature protein. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. The 
polypeptides of the present invention may be a naturally purified 
product, or a product of chemical synthetic procedures, or produced by 
recombinant techniques from a prokaryotic or eukaryotic host (for 
example, by bacterial, yeast, higher plant, insect and mammalian cells 
in culture). Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be 
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glycosylated with mammalian or other eukaryotic carbohydrates or may 
be nonglycosylated. Poljrpeptides of the invention may also include an 
initial methionine amino acid residue. 

Other systems of expression are also contemplated such as 
for example transgenic non-human mammals in which the gene of 
interest, preferably cut out from a vector and preferably in 
association with a mammary promoter to direct expressed protein into 
the animal's milk, is introduced into the pronucleus of a mammalian 
zygote (usually by microinjection into one of the two nuclei (usually 
the male nucleus) in the pronucleus) and thereafter implanted into a 
foster mother. A proportion of the animals produced by the foster 
mother will carry and express the introduced gene which has integrated 
into a chromosome. Usually the integrated gene is passed on to 
offspring by conventional breeding thus allowing ready expansion of 
stock. Preferably the protein of interest is simply harvested from 
the milk of female transgenic animals. The reader is directed to the 
following publications: Simons et al. (1988), Bio/Technology 
6:179-183; Wright et al. (1991) Bio/Technology 9:830-834; OS 4,873,191 
and; US 5,322,775. Manipulation of mouse embryos is described in 
Hogan et al , "Manipulating the Mouse Embryo; A Laboratory Manual**, 
Cold Spring Harbor Laboratory 1986. 

Transgenic plant technology is also contemplated such as for 
example described in the following publications: Swain W.F. (1991) 
TIBTE(3I 9: 107-109; Ma J.K.C. et al (1994) Eur. J. Immunology 24: 
131-138; Hiatt A. et al (1992) FEBS Letters 307:71-75; Hein M.B. et al 
(1991) Biotechnology Progress 7: 455-461; Duering K. (1990) Plant 
Molecular Biology 15: 281-294. 

If desired, host genes can be inactivated or modified using 
standard procedures as outlined briefly below and as described for 
example in "Gene Targeting; A Practical Approach", IRL Press 1993. 
The target gene or portion of it is preferably cloned into a vector 
with a selection marker (such as Neo) inserted into the gene to 
disrupt its function. The vector is linearised then transformed 
(usually by electroporation) into embryonic stem (ES) cells (eg 
derived from a 129/Ola strain of mouse) and thereafter homologous 
recombination events take place in a proportion of the stem cells. 
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The stem cells containing the gene disruption are expanded and 
injected into a blastocyst (such as for example from a C57BL/6J mouse) 
and implanted into a foster mother for development. Chimeric 
offspring can be identified by coat colour markers. Chimeras are bred 
to ascertain the contribution of the ES cells to the germ line by 
mating to mice with genetic markers which allow a distinction to be 
made between ES derived and host blastocyst derived gametes. Half of 
the ES cell derived gametes will carry the gene modification. 
Offspring are screened (eg by Southern blotting) to identify those 
with a gene disruption (about 50% of progeny). These selected 
offspring will be heterozygous and therefore can be bred with another 
heterozygote and homozygous offspring selected thereafter (about 25?: 
of progeny). Transgenic animals with a gene knockout can be crossed 
with transgenic animals produced by known techniques such as 
microinjection of DNA into pronuclei, sphaeroplast fusion (Jakobovits 
et al . (1993) Nature 362:255-258) or lipid mediated transfection (Lamb 
et al . (1993) Nature Genetics 5 22-29) of ES cells to yield transgenic 
animals with an endogenous gene knockout and foreign gene replacement. 

ES cells containing a targeted gene disruption can be 
further modified by transforming with the target gene sequence 
containing a specific alteration, which is preferably cloned into a 
vector and linearised prior to transformation. Following homologous 
recombination the altered gene is introduced into the genome. These 
embryonic stem cells can subsequently be used to create transgenics as 
described above. 

The term "host cell" includes any procaryotic or eucaryotic 
cell suitable for expression technology such as for example bacteria, 
yeasts, plant cells and non-human mammalian zygotes, oocytes, 
blastocysts, embryonic stem cells and any other suitable cells for 
transgenic technology. If the context so permits the terra "host cell" 
also includes a transgenic plant or non-human mammal developed from 
transformed non-human mammalian zygotes, oocytes, blastocysts, 
embryonic stem cells, plant cells and any other suitable cells for 
transgenic technology. 

According to another aspect of the present invention there 
is provided a cell comprising a vector or a pol3mucleotide sequence 
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capable of encoding a mutant CPB enzyme of the present invention* 
According to another aspect of the present invention there is provided 
a nucleotide sequence encoding a mature human pancreatic 
carboxjrpeptidase B defined in SEQ ID NO: 39 from position 109 onvards 
or a mutant thereof in vhich there is a cysteine residue encoded at 
position 243. This cysteine residue at position 243 in the cloned 
sequence is not observed in other published human pancreatic 
carboxypeptidase B sequences, as highlighted by Yamamoto et al, in the 
Journal of Biological Chemistry, v267, 2575-2581, 1992, where she 
shows a gap in her sequence following the position numbered 244, when 
aligned with other mammalian pancreatic carboxsrpeptidase B amino acid 
sequences (see discussion in Reference Example 6). Preferably the 
nucleotide sequence is in isolated form, that is to say at least 
partially purified from any naturally occurring form. Preferably the 
mutants are mutant CPB enzymes suitable for the present invention. 

According to another aspect of the present invention there 
is provided a method of making human pancreatic carboxypeptidase B or 
a mutant thereof in which there is a cysteine residue encoded at 
position 243 comprising expression in a host cell of a nucleotide 
sequence encoding a mature human pancreatic carboxypeptidase B 
defined in SEQ ID NO: 39 from position 109 onwards or a mutant thereof 
in which there is a cysteine residue encoded at position 243. 

According to another aspect of the present invention there 

is provided prodrugs of Formula 1 wherein: 

V represents a direct bond or CH^ 
12 

R and R independently represent CI, Br, I or -OSO^Me 
3 4 

R and R independently represent H, C^ ^alkyl, Cj^ ^alkoxy, F or CI 

C, ,alkyl or 
5* 6 

R and R independently represent H or C, ,alkyl or 
3 6 

R and R together can represent -CH»:CH-CH-CH- to form a bicyclic ring 
system optionally containing 1-3 heteroatoms selected from 0, N and S 

X is selected from : 

7 8 7 8 

-CHR CHR - where R and R are selected from H and C. ^alkyl 

7 8 

optionally substituted with phenyl provided at least R or R is H; 



SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 PCT/GB96/01975 

- 21 - 

-NHCHR^- where is selected from H; 

the side chain of common amino acids including for example 
the side chain of Ala, Arg, Asn, Asp, Cys, Glu, Gin, His, 
He, Leu, Lys, Met, Phe, Ser, Thr, Trp, Tyr & Val; 
(CH2)j^C0NHR^^ where n«l-3 and R^^ is selected from 
Cj^galkyl, cyclopentyl, cyclohexyl and phenyl and each R^^ 

listed hereinbefore is optionally substituted with halogen, Cj^^^alkyl 

or Cj_^alkoxy; 

-NH-N{R*^)- where R^^ is selected from H and C, ,alkyl; 



Y represents NH or 0 
Z is selected from 

.(CH2)„-C02H (n = 1- 4) 

-CH2OCH2CO2H 

-CH2-CH=CH-C02H 

-(CH2)jjtetrazol-5yl (n « 1- A) 

•(CH2)j^C0NHS02R^^ (n = 1- A) in which R^^ is selected from 

Ci.A^lkyl 

-(CH2)j^S02NH2 (n - 1- 4) 
and salts thereof. 

According to another aspect of the present invention there 
is provided any one of the following compounds or a pharmaceutically 
acceptable salt thereof: 

a) N- ( ^- { ^- [ bis- ( 2-chloroethyl ) -amino ] -3-methyl-phenoxy } - 
benzoyl ) -L-alanine ; 

b) N- [N- (4- {4- [bis- (2-chloroethyl) -amino] -3-methyl-phenoxy) - 
benzoyl ) -L-alanine ]-L-glutamic acid; 

c) N- (4- {4- [bis- (2-chloroethyl ) -amino ]-phenoxy) -benzoyl )-L- 
alanine; or 

d) N- [N- (4- [4- [bis- (2-chloroethyl) -amino] -phenoxy) -benzoyl ) -l-- 
alanine] -L-glutamic acid. Compounds b) and d) are preferred prodrug 
second components of the invention. Compounds a) and c) are the 
corresponding drugs. 

According to another aspect of the present invention there 
is provided a compound of Formula 1 or prodrugs b) or d) described 
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above or a phannaceutically acceptable salt thereof for use as a 
medicament. 

According to another aspect of the present invention there 
is provided the compoxind of Formula 1 or prodrugs b) or d) described 
above or a pharmaceutically acceptable salt thereof for preparation of 
a a medicament for treatment of cancer (in combination with a first 
component of the invention). 

In this specification the generic term "alkyl" includes both 
straight-chain and branched-chain alkyl groups. However references to 
individual alkyl groups such as "propyl" are specific for the 
straight-chain version only and references to individual 
branched-chain alkyl groups such as "isopropyl" are specific for the 
branched-chain version only. An analogous convention applies to other 
generic terms. 

It is to be understood that, insofar as certain of the 
compounds of Formula 1 may exist in optically active or racemic forms 
by virtue of one or more asymmetric carbon atoms, the invention 
includes in its definition any such optically active or racemic form 
which possesses the property of being a substrate for mutant CPBs of 
the invention. However in compounds of Formula 1, at the carbon atom 
having groups Y, Z and COOH attached, if there is a corresponding free 
amino acid then the carbon atom preferably has an L configuration in 
the corresponding free amino acid. 

The synthesis of optically active forms may be carried out 
by standard techniques of organic chemistry well known in the art, for 
example by ssmthesis from optically active starting materials or by 
resolution of a racemic form. Similarly, substrate properties against 
mutant CPBs may be evaluated using the standard laboratory techniques. 

A suitable pharmaceutically-acceptable salt of a basic 
compound of Formula 1 is, for example, an acid-addition salt with, for 
example, an inorganic or organic acid, for example hydrochloric, 
hydrobromic, sulphuric, phosphoric, trifluoroacetic, citric or maleic 
acid. In addition a suitable pharmaceutically-acceptable salt of an 
acidic compound of Formula 1 is an alkali metal salt, for example a 
sodium or potassium salt, an alkaline earth metal salt, for example a 
calcium or magnesium salt, an ammonium salt or a salt with an organic 
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base which affords a physiologically-acceptable cation, for example a 
salt with methylamine, dimethylamine, trimethylamine, piper idine, 
morpholine or tris-(2-hydroxyethyl)amine- 

The compounds of this invention may be utilized in 
compositions such as tablets, capsules or elixirs for oral 
administration, suppositories for rectal administration, sterile 
solutions or suspensions for parenteral or intramuscular 
administration, and the like. The compounds of this invention can be 
adminstered to patients (animals and human) in need of such treatment 
in dosages that will provide optimal pharmaceutical efficacy. 
Although the dose will vary from patient to patient depending upon the 
nature and severity of disease, the patient's weight, special diets 
then being followed by a patient, concurrent medication, and other 
factors which those skilled in the art will recognize, the dosage 
range will generally be about 1 to lOOOmg. per patient per day which 
can be administered in single or multiple doses • Preferably, the 
dosage range will be about 2.5 to 250mg. per patient per day; more 
preferably about 2.5 to 75mg. per patient per day. 

Naturally, these dose ranges can be adjusted on a unit basis 
as necessary to permit divided daily dosage and, as noted above, the 
dose will vary depending on the nature and severity of the disease, 
weight of patient, special diets and other factors. 

Typically, these combinations can be formulated into 
pharmaceutical compositions and discussed below. 

About 1 to lOOmg. of compound or mixture of compounds of 
Formula 1 or a physiologically acceptable salt thereof is compounded 
with a physiologically acceptable vehicle, carrier, excipient, binder, 
preservative, stabilizer, flavor, etc., in a unit dosage form as 
called for by accepted pharmaceutically practice. The amount of 
active substance in these compositions or preparations is such that a 
suitable dosage in the range indicated is obtained. 

Illustrative of the adjuvants which can be incorporated in 
tablets, capsules and the like are the following: a binder such as gum 
tragacanth, acacia, corn starch or gelatin; an excipient such as 
microcrystalline cellulose; a disintegrating agent such as corn 
starch, pregelatinized starch, alginic acid and the like; a lubricant 
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such as magnesium stearate; a sweetening agent such as sucrose , 
lactose or saccharin; a flavoring agent such as peppermint, oil of 
vintergreen or cherry. Vhen the dosage unit form is a capsule , it may 
contain, in addition to materials of the above type, a liquid carrier 
such as fatty oil. Various other materials may be present as coatings 
or to otherwise modify the physical form of the dosage unit. For 
instance, tablets may be coated with shellac, sugar or both. A syrup 
or elixir may contain the active compound, sucrose as a sweetening 
agent, methyl and propyl pariabens as preservatives, a dye and a 
flavoring such as cherry or orange flavor. 

Sterile compositions for injection can be formulated 
according to conventional pharmaceutical practice by dissolving or 
suspending the active substance in a vehicle such as water for 
injection, a naturally occuring vegetable oil like sesame oil, coconut 
oil, peanut oil, cottonseed oil, etc., or a synthetic fatty vehicle 
like ethyl oleate or the like. Buffers, preservatives, antioxidants 
and the like can be incorporated as required. 

A compound of the invention of Formula I, or a 
pharmaceutically-acceptable salt thereof, may be prepared by any 
process known to be applicable to the preparation of 
structurally-related compounds. Such procedures are provided as a 
further feature of the invention and are illustrated by the following 
representative examples in which variable groups have any of the 
meanings defined hereinbefore unless otherwise indicated. Vhere a 
sjmthesis of a particular compound is expanded upon below it will be 
appreciated that the general methodology can be applied to cover all 
compounds of the particular structure under discussion. 

1. Compounds with W = direct bond 

Such compounds may be prepared as outlined in Figure 9. 

These prodrugs are cleaved by mutant CPB to liberate an intermediate 

which further collapses to release the corresponding phenol mustard 
(tSTpical ICcQ = l.SyM). 
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Suitable reagents for steps a-d include: 

(a) DCCI, HOBT or water soluble carbodiimide (EDCI) or isobutyl 
chlorof ormate/triethylamine ; 

(b) TFA (if Pj « t-butyl, P2 is benzyl) or H2,Pd/C (if Pj is benzyl 
and Pj- t-butyl); 

(c) EDCI,DMAP,CHCl3, 

(d) Hj/Pd/C If Pj^benzyl or TFA if t-butyl is used in the protection. 
Compound 2 in Figure 9 

i) yhen Y « NHj and a protecting group such as benzyl, 
and when Z is for example -(CH2)jj-C02H (n = 1-4) then when n » 1, 
dibenzyl L-aspartic acid is used; when n = 2, L-glutamic acid dibenzyl 
ester is used and; when n ^ 3, L-2-amino adipic acid dibenzyl ester is 
used. 

ii) When Z is -(CH2)^-tetrazole: in the case of for example n « 
2, the sequence of reactions illustrated in Figure 10 is used to 
generate the required dibenzyl protected intermediate from the known 
methyl ester. Suitable reagents for steps a-e include: 

(a) CS2CO3, PhCHjBr, DMF; 

(b) lOX Pd/C, H2,B0C-0-B0C ; 

(c) NaOH, MeOH, H2O; 

(d) CS2CO2, PhCH2Br, DMF ; isomers separated; 

(e) HCl, ether, CH2CI2 

iii) When Z is -(CH2)jjC0NHS02R^^ in the case of for example n . 2 
and R^^ = Me, the protected intermediate is made from N-BOC-a- benzyl 
glutamic acid as illustrated in Figure 11. Suitable reagents for 
steps a-b include: 

(a) MeS02NH2 DCCI, DMAP; 

(b) HCl, EtOAc. 
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iv) Vhen Z » -(CH2)qS02NH2 in the case of for example n«:2, then 

L-2-dinino-4-sulfdinoylbutyric acid-benzyl ester, produced from 
L-2-amino-4-sulfamoylbutyric acid (Aldrich Chemical Company) , is used. 



V) Compounds where Y is OH are generated by established routes 

or by for example using compounds such as L-malic acid instead of the 
corresponding L-glutamic acid* 

Compound 3 in Figure 9 

i) When X = -CH2CH2- the intermediate can be made by reacting 
a compound illustrated as compound 2 in Figure 9 with succinic 
anhydride to generate the half succinate ester where 

Pj^ » H. Alternatively half esters of succinic acid can be used to 
couple to the above intermediate instead of using succinic anhydride. 

9 

ii) When X= -NHCH(R )- the prodrug is cleaved by mutant CPB to 
generate a compound of Formula 5 which is directly cytotoxic. For 
example when =(CH2)2CONH-nC^Hg and R^=R^»C1, R^«R*«R^=R^=H the 
cytotoxicity versus LoVo cells is about ICjq- 20mM. 

To make compounds where X s NHCH(R^) conventional peptide 
coupling methodology is used as illustrated in Figure 12. The 
intermediate is then treated with acid (eg HCl/ether) to form the free 
amine. Coupling to the phenol mustard is carried out as illustrated 
in Figure 13. Suitable reagents for step a include: 

1. para-nitrophenylchloroformate, triethylamine, chloroform 

2. triethylamine, CH2CI2 or; 

1. COClj/ Quinoline, CH2CI2 

2. triethylamine, CH2CI2 

Compounds where W = CH2 and X = -NH-NH(R^^)- 

Such compounds may be synthesised as illustrated in Figure 14. 

Suitable reagents for steps a-b include: 

(a) B0C-N(R^^)-NH2, EDAC, CH2CI2 ; TFA, HCl/ETOAc 
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(b) 1. pyridine, CH2CI2 

2. triethylamine, CH2CI2 

The resulting product is then deprotected by standard 

methods • 

Uhen a pharmaceutically-acceptable salt of a compound of the 
formula I is required, it may be obtained, for example, by reaction of 
said compound vith a suitable acid or base using a conventional 
procedure. When an optically active form of a compound of the formula 
I is required, it may be obtained by carrying out one of the aforesaid 
procedures using an optically active starting material, or by 
resolution of a racemic form of said compound using a conventional 
procedure. 

Further uses of mutant CPBs of the invention include the 

following. 

i) Carboxypeptidase enzymes may be used for the sequential 
removal of C-terminal amino acids from proteins and, following amino 
acid analysis of the residues released, can be used for determining 
the C-terminal amino acid sequence of proteins (R. P. Ambler, in : 
Methods in Enzymology, 1967, vol. XI, 436-AA5, Academic Press). The 
use of a mutant CPB possessing specificity for C-terminal aspartate 
and glutamate residues allows the use of these enzjmies in extending 
the scope and ease of C-terminal analysis by carboxypeptidase 
digestion. 

ii) Mutant enzymes may be used as enzyme labels in immunoassays. 
Product from substrate (prodrug) turnover may be detected by any 
suitable technique eg HPLC. Immunoassay techniques using enzymes as 
labels are described in A Practical Guide to ELISA by D.M. Kemeny, 
Pergamon Press 1991. 

The invention will now be described by the following 
non-limiting Examples (with reference to the Reference Examples) in 
which: 

(i) evaporations were carried out by rotary evaporation in 
vacuo and work-up procedures were carried out after removal of 
residual solids by filtration; 
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(ii) operations vere carried out at room temperature ^ that 
is in the range 18-25**C and under an atmosphere of an inert gas such 
as argon; 

(iii) column chromatography (by the flash procedure) and 
medium pressure liquid chromatography (MPLC) vere performed on Merck 
Kieselgel silica (Art. 9385) or Merck Lichroprep RP-IS (Art. 9303) 
reversed-phase silica obtained from E. Merck, Darmstadt, Germany; 

(iv) yields are given for illustration only and are not 
necessarily the maximum attainable; 

(v) the end-products of the Formula I have satisfactory 
microanalyses and their structures vere confirmed by nuclear magnetic 
resonance (NMR) and mass spectral techniques; unless othervise stated, 
CDClj solutions of the end-products of the Formula I vere used for the 
determination of NMR spectral data, chemical shift values vere 
measured on the delta scale; the folloving abbreviations have been 
used: s, singlet; d, doublet; t, triplet; m, multiplet; 

(vi) intermediates vere not generally fully characterised 
and purity vas assessed by thin layer chromatographic, infra-red (IR) 
or NMR analysis; 

(vii) melting points are uncorrected and vere determined 
using a Mettler SP62 automatic melting point apparatus or an oil-bath 
apparatus; melting points for the end-products of the formula I vere 
determined after crystallisation from a conventional organic solvent 
such as ethanol, methanol, acetone, ether or hexane, alone or in 
admixture and; 

(viii) all temperatures are in ^C. 

A brief description of the Figures is set out belov. 
Figure 1 illustrates pancreatic HCFB cloning. 
Figure 2 illustrates pancreatic HCFB sequencing. 
Figure 3 Illustrates vector pICI1266. 

Figure 4 Illustrates pICI1266 expression vector gene cloning. 

Figure 5 illustrates cytotoxicity of a prodrug and corresponding drug.. 

Figure 6 lists the composition of a grovth medium. 

Figures 7-14 illustrate chemical S5mthetlc procedures. 

Figure 15 shovs cjrtotoxlclty of the prodrug of Example 21 alone and 

corresponding drug of Example 22 alone in LoVo tumour cells. 
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Figure 16 shows cytotoxicity of the prodrug of Example 21 in the 
presence of the mutated enzyme, D253K HCPB in LoVo tumour cells. 
Numbered rows represent the following: 1^ blank (no cells); 2-4b the 
drug of Example 22 at 50, 100 & 200mM respectively; 5-8= the prodrug 
of Example 21 in the presence of 1.47, 2.4, 5.9 & 11.75 ug/ml of D253K 
HCPB respectively; 9= the prodrug of Example 21 at 500pH; and 10» 
control (cells only). Each numbered row contains 2 bars (with margin 
of error indicated) wherein each bar represents data from 6 wells on a 
plate . 

Figure 17 illustrates a chemical synthesis. 
Reference Example 1 

Synthesis of Hippuryl-L-Glutamic Acid (see Figure 8) 

Hippuryl-L-glutamic acid dibenzyl ester (compound 3) {2.06g, 

4.2x10"^ moles) and 20% Pd/Carbon {50% moist) (0.77g) in THF were 

stirred in an atmosphere of hydrogen for 1.5 hours. The mixture was 

TM 

filtered through diatomaceous silica (Celite ) and the filtrate 
evaporated to dryness. Trituration with diethyl ether gave the 
desired end product as a white crystalline solid 1.02g (78%). Melting 
point 169-171^C. 20D « -2.5*» 

NMR DMSO d6 12.3, 2H (broad); 8.7, IH (t); 8.2 ,1H (t); 7.9, 2H (m); 
7.5, 3H (m); 4.3, IH (m); 3.9, 2H (m); 2.3, 2H (t); 1.9, 2H (m) 

The starting material compound 3 was prepared as follows. 
To a solution of hippuric acid (0.90g, 5x10 moles) and L-glutamic 
acid dibenzyl ester (2.50g, 5x10"^ moles) in DMF (35ml) was added 
l-hydroxybenzotriazole (0.73g, 5.5x10 moles), triethylamine (1.4ml, 
9.7x10""^ moles) and l(3-dimethyl-aminopropyl)-3-ethylcarbodiimide, HCl 
salt (1.05g, 5.5x10'^ moles). The mixture was stirred overnight at 
room temperature, poured into water (400ml) and extracted twice with 
ethyl acetate (100ml). The combined extracts were washed with 
saturated sodium bicarbonate solution, water, 2N HCl and water. The 
organic phase was dried over MgSO^ and evaporated to obtain the 
desired starting material as a yellow oil. 2.06g (84%). 
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NMR DMSO d6 8.7, IH (t); 8.4, IH (d); 7.9, 2H (m); 7.5, 3H (m); 
7.35, lOH (m); 5.15, 2H (s); 5.05, 2H (s); 4.4, IH (m); 3.9, 2H (t); 
2.0, 4H (m). 

Reference Example 2 

Synthesis of Hlppuryl-L*Aspartic acid 

-3 

Hippuryl-L-aspartlc acid dibenzyl ester (1.28g , 2.7x10 

moles) and 30X Pd/Carbon (50% moist) (0.51g) in THF were stirred in an 

atmosphere of hydrogen for 3 hours. The mixture vas filtered through 
TM 

Celite and the filtrate evaporated to drjmess. Trituration with 
diethyl ether gave an off-white crystalline solid 0.62g (76%). 
Melting point 200-202*^0. 20D » + 7.9* 

NMR DMSO d6 12-5, 2H (broad); 8.7, IH (t); 8.2, IH (d); 7.7 ,2H 

(m); 7.5, 3H (m); 4.6, IH (m) ; 3.9, 2H (d); 2.7, 2H (m) 

The starting material was synthesised as follows. To a 

,3 

solution of hippuric acid (0.90g, 5x10 moles) and L-aspartic acid 

dibenzyl ester (2.31g, 5x10 moles) in DMF (35ml) was added 

.3 

l-hydroxybenzotriazole (0.73g, 5.5x10 moles), triethylamine (1.4ml, 
.3 

9.7x10 moles) and l-(3-dimethyl-aminopropyl)-3-ethylcarbodiimide, 

-3 

HCl salt (1.05g, 5.5x10 moles). The mixture was stirred for 4 hours 
at room temperature then poured into water (450ml) and extracted twice 
with ethyl acetate (100ml). The extract was washed with saturated 
sodium bicarbonate solution, water, 2N HCl and water. The organic 
phase was dried over MgSO^ and evaporated to dryness to obtain the 
desired starting material as a yellow oil. 1.90g (80%) 
NMR DMSO d6 8.7, IH, (t); 8.45, IH, (d); 7.9, 2H (m); 7.5, 3H (m); 
7.3, lOH (m); 5.15, 2H (s); 5.05, 2H (s); 4.8, IH (m); 3.9, 2H (m); 
2.9, 2H (m) 
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Reference Example 3 

EnzyDic activity of recombinant HCPB against Hipp-Arg. 

Purified human CPB, produced as described in Reference 
Example 12, was assayed for its ability to convert hippuryl-L-arginine 
(Hipp-Arg; Sigma) to hippuric acid using a spectrophotometric assay* 

The Km and kcat for native HCPB were determined by measuring 
the initial rate of conversion of Hipp-Arg to hippuric acid at 254 nH 
using a range of Hipp-Arg concentrations (0. 75-0 • 125 mM ) and a CPB 
enzyme concentration of lyg/ml. Measurements were carried out at 37**C 
in 0.25 mM Tris HCl buffer, pH 7,5 using 1 cm path length cuvettes in 
a total volume of 1.0 ml using a Perkin Elmer Lambda 2 
spectrophotometer. Km and Vmax values were calculated using the 
ENZFITTER software programme (Biosoft, Perkin Elmer). Kcat was 
calculated from Vmax by dividing by the enzsme concentration in the 
reaction mixture. 

The results for human CPB against Hipp-Arg were: 
Km B 0.18 mM 
kcat « 65 s"^ 

The results demonstrate that the recombinant HCPB is 
enzymatically active and can cleave the amide bond in Hipp-Arg to 
release Hippuric acid. 

Reference Example 4 

Sjmthesis of an Arginine mustard prodrug (see Figure 7) 

(2S) ,2-(3-{4- [bis- (2-chloroethyl) -amino) -phenoxycarbonyl) -propionyl- 

amino)-5-guanidino-pentoic acid (compound 5c, Figure 7) 

A solution of (2S),2-(3-{4-[bis-(2-chloroethyl)-amino)- 
phenoxycarbonyl)-propionyl-amino)-5-(2-nitro)-guanidino-pentoic acid 
benzyl ester (compound 4c, Figure 7) (275 mg; 0.44 mmol) in ethyl 
acetate/MeOH (1/1: V/V) (8 ml) containing 10 % Pd/C (200 mg) was 
hydrogenated in a Paar apparatus at 80 psi for 6 h. After filtration 
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the organic layer was evaporated* The resulting oil vas recrystallised 
using CH2CI2 /diethyl ether to give the desired compound 5c as a vhite 
solid (180 mg), yield 842. 

^HNMR (CDjOD): 1.55-1-7 (m, 3H); 1.8-1.9 (m, IH); 2.6-2.7 (m, 2H); 

2.75-2.85 (m, IH); 2.9-2.95 (m, IH); 3.1-3.2 (m, 2H); 3.6-3.7 (in, 4H); 

3.7-3.8 (m, 4H); 4.3 (dd, IH); 6.75 (dd, 2H); 6.95 (dd, 2H). 

MS (ESI) : 512-514 (MNa)+ 

Anal (C2oH29N50^Cl2 1.5 H2O) 

Calc. C: 47.91 Hi 6.43 N: 13.97 

Found C: 47.7 H: 6.21 Ni 14.26 

Starting material compound 4c vas prepared as follows. To a 
solution of (2S),2-araino-5-(2-nitro)-guanidino-pentoic acid benzyl 
ester (compound 2c) (654 mg; 1 mmol) in CHCl^ (10 ml) vas added 
dihydro-furan-2,5-dione (compound 1) (120 mg; 2 mmol) folloved by 
triethylamine (202 mg; 2 mmol) dropvise. After stirring for 2h at 
room temperature, the solvent vas evaporated and the crude residue vas 
dissolved in vater. pH vas adjusted to 2.5 vith 2N HCl. The aqueous 
layer vas extracted vith ethyl acetate. The organic layer vas vashed 
vith brine, dried (MgSO^) and evaporated to give 

(2S ) , 2- ( 3-carboxy-propionylamino ) -5- ( 2-nitro ) -guanidino-pentoic acid 
benzyl ester (compound 3c). The resulting solid vas triturated vith 
diethylether and filtered off: 280 mg (68 X). 

IHNMR (CD30D): 1.52-1.68 (m, 2H); 1.7-1.8 (m, IH); 1.85-1.95 (m, IH); 
2.45-2.7 (m, 4H); 3.15-3.3 (m, 2H); 4.5 (m, IH); 5.15 (dd, 2H); 
7.25-7.4 (m, 5H) 
MS (ESI): 432 [MNa]+ 

To a suspension of compound 3c (204 mg; 0.5 mmol) in CHCl^ 
(5 ml) vas added 4- (bis (2-chloroethyl) amino) -phenol (compound 6) (135 
mg; 0.5 mmol), EDCI (19 mg; 0.5 mmol) folloved by DMAP (18 mg; 0.75 
mmol). After stirring at room temperature for 6h, the solvent vas 
evaporated. The residue vas partitioned betveen ethyl acetate and 
vater and the aqueous phase acidifed to pH « 3 vith 2N HCl. After 
extraction vith ethyl acetate, the organic layer vas vashed vith 
brine, dried (MgSO,) and evaporated. The residue vas purified by flash 
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chromatography using CH2Cl2/MeOH (95/5: V/V) as eluant to give the 
desired starting material 4c as a white foam (281 mg) yield: 90 X. 
4c: ^HNHR ((3)300): 1.55-1.7 (m, 2H); 1.7-1.8 (m, IH); 1.85-1.95 
(m, IH); 2.55-2.75 (m, 2H); 2.8-2.9 (m, 2H); 3.15-3.25 (m, 2H); 
3.6-3.7 (m, 4H); 3.7-3.8 (m, 4H); 4.5 (dd, IH); 5.15 (dd, 2H); 6.7 
(d, 2H); 6.95 (d, 2H); 7.32 (m, 5H) 
MS (ESI) : 647-649 [MNaJ+ 

Keference Example 5 

Synthesis of succinic acid mono-{4-[NyN-bis(2-chloroethyl)amino]"* 
-phenyl} ester (also called "intermediate" herein) 

To a suspension of succinic anhydride (225mg, 2.25mmol) in 
CHClj (10ml) was added under stirring, 4-[N,N-bis(2-chloroethyl)- 
amino] phenol (compound 6, Figure 7; 203mg, 0.75 mmol) followed by 
triethylamine (75mg, 0.75 mmol). The mixture was stirred overnight and 
the solvent evaporated. The crude residue was dissolved in 
Et0AC/Et20/H20 and under stirring the pH was adjusted to 3. The 
organic layer was washed with water, brine, dried (HgSO^), and 
evaporated. The resulting oil was crystallised from Et20/hexane and 
the white solid was filtered off and dried under vacuum to obtain the 
desired end product (210 mg; yield 83%). Melting point 98-100**C. 
MS (ESI): 356-358 [MNaj"*" 

NMR (CDCI3): 2.8 (dd, 2H); 2.9 (dd,2H); 3.65 (dd, 4H); 3.75 
(dd, 4H); 6.65 (d, 2H); 7.0 (d, 2H) 
Analysis (Cj^Hj^Cl20^N 0.2 H2O) : 
Calc. %C: 49.78 H: 5.19 N: 4.15 
Found %C: 49.9 H: 5.3 N: 4.2 

Reference Example 6 

Cloning of human pancreatic carboxypeptidase B (HCPB) 

Standard molecular biology techniques, such as restriction 
enzyme digestion, ligation, kinase reactions, dephosphorylation, 
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polymerase chain reaction (FCR), bacterial transformations, gel 
electrophoresis! buffer preparation and DNA generation, purification 
and isolation, vere carried out as described by Maniatis et al., 
(1989) Molecular Cloning , A Laboratory Manual; Second edition: Cold 
Spring Harbor Laboratory, Cold Spring Harbor, Nev York, or following 
the recommended procedures of manufacturers of specific products. In 
most cases enzymes vere purchased from Nev England BioLabs, but other 
suppliers, and equivalent procedures may be used. Oligonucleotide 
sequences vere prepared in an Applied Biosystems 380A DNA synthesiser 
from 5'dimethox5rtrityl base-protected nucleoside-2-cyanoethyl-N,N'-di- 
isopropyl-phosphoramldltes and protected nucleoside linked to 
controlled-pore glass supports on a 0.2 lomol scale, according to the 
protocols supplied by Applied Biosystems Inc.. 

The coding sequence for human pancreatic carboxypeptidase B 
vas obtained from a human pancreatic cDNA library cloned in the XgtlO 
vector (Clontech, Human pancreas 5' STRETCH cDNA, HL1163a) using FCR 
technology, and cloned into the plasmld vector pBluescript II KS-i- 
(Stratagene). 

Tjrpically, an aliquot of the cDNA library (5ul at a titre of 

o 

>10 pfu/ml) vas mixed vlth IpOpHols of tvo oligonucleotide primers, 
BPTl and BPBl, (SEQ ID NO: 28 and SEQ ID NO: 29), dNTFs to a final 
concentration of 200yH, Taq polymerase reaction buffer, and 2 .51) of 
Taq polymerase in a final volume of IOOmI. The mixture vas heated at 
94*'C for 10 minutes prior to addition to the Taq enzyme, and the FCR 
incubation vas carried out using 30 cycles of 94''C for 1.5 minutes, 
50**C for 2 minutes, and 72**C for 2 minutes, folloved by a single 
incubation of 72'*C for 9.9 minutes at the end of the reaction. 

The tvo oligonucleotide primers vere designed to allov FCR 
extension from the 5' of the gene from BFTl (SEQ ID NO: 28), betveen 
the start of the pre-sequence and the start of the pro-sequence, and 
FCR extension back from the 3' end of the gene from BFB1(SEQ ID NO: 
29), as shovn In Figure 1. BPTl and BPBl are also designed to 
introduce unique restriction sites. Sad and Xhol respectively, into 
the FCR product. 

An aliquot of the FCR product vas analysed for DNA of the 
correct size (about 1250 base pairs) by agarose gel electrophoresis 
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and found to contain predominantly a band of the correct size. The 
remainder of the product from the reaction mix vas purified and 
separated from excess reagents using a Centricon 100 microconcentrator 
column (Amicon), folloved by DNA isolation by ethanol/sodiuro acetate 
precipitation » centrifugation, vacuum drying and re-suspension in 
distilled water • The isolated DNA vas restriction digested with 
enzymes Sad and Xhol, and a band of the correct size (about 1250 base 
pairs) purified and isolated from agarose gel electrophoresis using 
excision and glass-milk (Geneclean, Stratec Scientific , or other 
similar product). 

pBluescript II KS-i- double stranded DNA (Stratagene) vas 
restriction digested vith Sad enzyme^ and the product 
dephosphorylation treated vith calf intestinal alkaline phosphatase to 
remove 5'phosphoryl groups and reduce re-ligation and vector 
background following transformation* The DNA product vas purified 
from enzyme reaction contaminants using glass-milk, and then 
restriction digested vith Xhol enzyme. DNA of the correct size (about 
2850 base pairs) vas purified and isolated by agarose gel 
electrophoresis using excision and glass-milk (Geneclean, Stratec 
Scientific, or other similar product). 

Aliquots of both restricted and purified DNA samples vere 
checked for purity and concentration estimation using agarose gel 
electrophoresis compared vith knovn standards. From these estimates 
ligation mixes vere prepared to clone the HCFB gene into the vector, 
using a molar ratio of about 1 vector to 2.5 insert (1 pBluescript II 
KS+ to 2.5 HCFB PGR product), and a final DNA concentration of about 
2.5ng/vil, in the presence of T4 DNA ligase, ImH ATP and enz3rme buffer. 

Folloving the ligation reaction the DNA mixture vas used to 
transform E.coli strain DH5a (Gibco-BRL, maximum efficiency competent 
cells). Cell aliquots vere plated on L-agar nutrient media containing 
lOOyg/ml amplcillin as selection for plasmid vector, and incubated 
over-night at 37*0. Colonies containing plasmids vith inserts of 
interest vere identified by hybridisation. 

About 200 colonies vere picked and plated onto duplicate 
sterile nitro-cellulose filters (Schleicher and Schull), pre-vet on 
plates of L-agar nutrient media containing lOOyg/ml amplcillin as 
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selection for plasmid vector^ and incubated over-night at 37**C. One 
duplicate plate is stored at A^^C, and acts as a source of live cells 
for the colonies, the other plate is treated to denature and fix the 
DNA from the individual colonies to the nitro-cellulose. The 
nitro-cellulose filter is removed from the agar plate and placed in 
succession onto Whatman filter papers soaked in : 

1. 102 SDS for 2 minutes 

2. 0,5M NaOH, 1,5H NaCl for 7 minutes 

3. 0.5H NaOH, 1-5M NaCl for 4 minutes 

4. 0.5M NaOH, 1«5M NaCl for 2 minutes 

5. 0*5M Tris pH7.A, 1.5M NaCl for 2 minutes 

6. 2xSSC (standard saline citrate) for 2 minutes. 

The filter is then placed on a Vhatman filter paper soaked 
in lOxSSC and the denatured DNA is crossed linked to the 
nitro-cellulose by ultra violet light treatment (Spectrolinker XL- 1500 
UV crosslinker) • The filters are then allowed to air dry at room 
temperature, and are then pre-hybridised at 60^C for one hour in a 
solution of 6xSSC with gentle agitation (for example using a Techne 
HB-ID hybridizer). Pre-hybridization blocks non-specific DNA binding 
sites on the filters. 

In order to determine vhich colonies contain DNA inserts of 

interest the DNA crosslinked to the nitro-cellulose filter is 

32 

hybridised with a radio-labelled P-DNA probe prepared from HCPB PCR 
product of the pancreatic cDNA library (see above). About 50ng of DNA 
was labelled with SOpCi of ^^P-dCTP (-3000Ci/raMol) using T7 DNA 
polymerase in a total volume of 50ul (Pharmacia T7 Quickprime kit), 
and the reaction allowed to proceed for 15 minutes at 37**C. The 
labelled probe is then heated to 95''C for 2 minutes, to denature the 
double stranded DNA, immediately added to 10ml of 6xSSC at 60^C, and 
this solution used to replace the pre-hybridisation solution on the 
filters. Incubation with gentle agitation is continued for about 3 
hours at SO'^C. After this time the hybridisation solution is drained 
off, and the filters washed twice at 60**C in 2xSSC for 15 minutes each 
time. Filters were then gently blotted dry, covered with cling film 
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(Saran wrap or similar), and exposed against X-ray film (for example 
Kodak Xomat-AR5 ) over-night at room temperature. Folloving 
development of the film, colonies containing inserts of interest were 
identified as those which gave the strongest exposure (darkest spots) 
on the X-ray film. In this series of experiments about 15% of the 
colonies gave positive hybridisation. From this 12 colonies were 
chosen for further screening. These colonies were picked from the 
duplicate filter, streaked and maintained on L-agar nutrient media 
containing lOOpg/ml ampicillin, and grown in L-broth nutrient media 
containing lOOpg/ml ampicillin. 

The selected isolates were checked by PCR for inserts of the 
correct size, using primers BPTl and BPBl, (SEQ ID NO: 28 and SEQ ID 
NO: 29), and for priming with an internal primer BPT2 (SEQ ID NO: 30) 
and BPBl. BPT2 is designed to prime at the end of the pro-sequence, 
prior to the start of the mature gene and to introduce an Xbal 
restriction site. 

For PCR screening colonies of the selected isolates were 
picked and dispersed into 200m1 of distilled water and heated at 100*'C 
for 10 minutes in a sealed Ependorph tube. The suspensions were then 
centrifuged for 10 minutes in a microfuge to pellet cell debris, and 
1m1 of the supernatant used as the DNA template in PCR screening. 
Typically, Ipl of supernatant was mixed with 20pMols of two 
oligonucleotide primers, BPTl and BPBl, or BPT2 and BPBl, dNTPs to a 
final concentration of 200mM, Taq polymerase reaction buffer, and 0.5U 
of Taq polymerase in a final volume of 20yl. The PCR incubation was 
carried out using 25 cycles of 9A*»C for 1.5 minutes, 50**C for 2 
minutes, and 72**C for 2 minutes, followed by a single incubation of 
72*C for 9.9 minutes at the end of the reaction. 

The PCR products were analysed for DNA of the correct size 
(about 1250 base pairs from primers BPTl to BPBl, and about 900 base 
pairs from primers BPT2 to BPBl, see Figure 1) by agarose gel 
electrophoresis. Ten of the twelve clones gave PCR DNA products of 
the correct size. Six of the ten clones were then taken for plasroid 
DNA preparation (using Qiagen Maxi kits, from 100ml of over-night 
culture at 2T*C in L-broth with lOOvig/ml ampicillin). These plasmid 
DNA preparations were then sequenced over the region of PCR product 
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insert using an USB Sequenase DNA sequencing kit, which incorporates 
bacteriophage T7 DNA polymerase. Each clone was sequenced using eight 
separate oligonucleotide primers, known as 676, 336, 337, 679, 677, 
1280, 1279 and 1281 (SEQ ID NOs: 30 to 37). The positioning of the 
sequencing primers within the HCPB sequence is shown diagramatically 
in Figure 2, primers 336, 1279, 676, 1280, 677 and 1281 being 
'forward', and 337 and 679 'backwards'. 

Five of the six clones were found to have identical sequence 
(SEQ ID NO: 38) of 1263 base pairs between and including the Sad and 
Xhol restriction sites, and this sequence was used in further 
experiments. The translation of the DNA sequence into its poljrpeptide 
sequence is shown in SEQ ID NO: 39. The start of the mature protein 
sequence is amino acid residue 109. Amino acid numbered 14 marks the 
start of the putative pro-enz3nne sequence. Only part of the enzyme 
secretion leader sequence (pre-sequence) is present in the cloned PCR 
generated DNA. The polypeptide sequence shows an aspartate residue at 
position 361, which when the whole sequence is aligned with other 
mammalian carboxypeptidase A and B sequences indicates a B type 
specificity (see amino acids numbered 255 by Catasus L, et al, Biochero 
J., 287, 299-303, 1992, and discussion). However, the cysteine 
residue at position 243 in the cloned sequence is not observed in 
other published human pancreatic carboxypeptidase B sequences, as 
highlighted by Yamamoto et al, in the Journal of Biological Chemistry, 
v267, 2575-2581, 1992, where she shows a gap in her sequence following 
the position numbered 244, when aligned with other mammalian 
pancreatic carboxypeptidase B amino acid sequences. Also shown on 
Figure 2 are the approximate sites of the aspartate amino acid residue 
in the enzyme recognition site, and the cysteine residue at position 
135 of the mature enzyme (position 243 in SEQ ID NO: 39). 

One of the clones was deposited on 23-November-1994 with the 
National Collection of Industrial and Marine Bacteria Limited (23 St. 
Hachar Drive, Aberdeen AB2 IRY, Scotland) and has the designation 
NCIHB 40694. The plasmid from this clone is known as pICI1698. 
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Reference Example 7 

Expression of mature HCPB-(His)g-c-M7c from E. coli 

In order to achieve the expression of mature HCPB from 
E.coli the mature gene from pICI1698 vas transferred into a plasmid 
vector which allows controlled secretion of protein products into the 
periplasm of the bacteria • This secretion vector^ known as pICI266, 
in a bacterial host MSD522 suitable for controlled expression, has 
been deposited on 11 October 1993 with the National Collection of 
Industrial and Marine Bacteria Limited (Aberdeen AB2 IRY, Scotland) 
and has the designation NCIHB 40589* A plasmid map of pICI266 is 
shown in Figure 3. The plasmid has genes for tetracycline resistance 
and induction (TetA and TetR), an AraB operator and promoter sequence 
for inserted gene expression, and an AraC gene for expression control. 
The promoter sequence is followed by the PelB translation leader 
sequence which directs the polypeptide sequence following it to the 
periplasm. The site of gene cloning has several unique restriction 
sites and is followed by a phage T4 transcription terminator sequence. 
The DNA sequence in this region and the features for gene cloning are 
shown diagramatlcally in Figure 4. 

For the cloning of the mature HCPB sequence into pICI266 it 
was decided to generate HCPB DNA by PCR, and to make some alterations 
to the codon usage at the start of the mature gene to introduce E.coli 
preferred codons. Also, to help with detection and purification of 
the expression construct a C-term peptide tag, knoxm as (His)g-c-myc 
was added to the enzyme. The tag consists of 6 histidines, a 
tri-peptide linker (EPE) and a peptide sequence (EQKLISEEDL) from 
c-myc which is recognised by the antibody 9E10 (as published by Evan 
et al, Mol Cell Biol, v5, 129-136, 1985, and available from Cambridge 
Research Biochemicals and other antibody suppliers). The C-term is 
completed by the addition of an Asparagine. The 6 histidine residues 
should allow the purification of the expressed protein on a metal 
chelate column (for example Ni-NTA Agarose from Qiagen). In addition 
the PCR primers are used to introduce unique restriction sites at the 
5' (Fspl) and 3' (EcoRI) of the gene to facilitate the introduction of 
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the PCR product into the expression vector. The sequence of the tvo 
primers, known as FSFTSl and 6HIS9E10R1BS1, are shovn in SEQ ID NOs: 
40 and 41. 

To generate a modified gene for cloning into pICI266, PCRs 
were set up using lOOpMols of primers FSPTSl and 6HIS9E10R1BS1 in the 
presence of approximately 5ng of pICI1698 DNA, dNTPs to a final 
concentration of 200jiM, Taq polymerase reaction buffer, and 2.5U of 
Taq polymerase in a final volume of 100^1 • The mixture was heated at 
94*C for 10 minutes prior to addition to the Taq enzyme, and the PCR 
incubation was carried out using 30 cycles of 94*^0 for 1.5 minutes, 
SO^'C for 2 minutes, and 72**C for 2 minutes, followed by a single 
incubation of 72**C for 9.9 minutes at the end of the reaction. An 
aliquot of the PCR product was analysed for DNA of the correct size 
(about 1000 base pairs) by agarose gel electrophoresis and found to 
contain predominantly a band of the correct size. The remainder of 
the product from the reaction mix was purified and separated from 
excess reagents using a Centricon 100 microconcentrator column 
(Amicon), followed by DNA isolation by ethanol/sodium acetate 
precipitation, centrifugation, vacuum drying and re-suspension in 
distilled water. The isolated DNA was restriction digested with 
enzymes Fspl and EcoRI, and a band of the correct size (about 1000 
base pairs) purified and isolated from agarose gel electrophoresis 
using excision and glass-milk (Geneclean, Stratec Scientific, or other 
similar product). 

pICI266 double stranded DNA, prepared using standard DNA 
technology (Qiagen plasmid kits or similar), was restriction digested 
with Kpnl enzyme, being very careful to ensure complete digestion. 
The enzyme was then inactivated by heating at 65**C for 10 minutes, and 
then cooling on ice. The 3' over-hang generated by the Kpnl was then 
enzymatically digested by the addition of T4 DNA polymerase as 
recommended by the supplier (New England BioLabs), in the presence of 
dNTPs and incubation at le^C for 15 minutes. The reaction was stopped 
by inactivating the enzyme by heating at 70**C for 15 minutes. The DNA 
product was purified from enzyme reaction contaminants using 
glass-milk, an aliquot checked for yield by agarose gel 
electrophoresis, and the remainder restriction digested with EcoRI 
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enzyme. Again care was taken to ensure complete restriction digest. 
DNA of the correct size (about 5600 base pairs) was purified and 
isolated by agarose gel electrophoresis using excision and glass-milk 
(Geneclean, Stratec Scientific, or other similar product). 

Aliquots of both restricted and purified DNA samples were 
checked for purity and concentration estimation using agarose gel 
electrophoresis compared with known standards. From these estimates 
ligation mixes were prepared to clone the HCPB gene into the vector, 
using a molar ratio of about 1 vector to 2.5 insert (1 pICI266 to 2.5 
HCPB PCR product), and a final DNA concentration of about 2.5ng/ul, in 
the presence of T4 DNA ligase, ImM ATP and enzyme buffer, using 
conditions suitable for the ligation of blunt ended DNA (Fspl to T4 
DNA polymerase treated Kpnl). 

Following the ligation reaction the DNA mixture was used to 
transform E.coli strain DHSa (Gibco-BRL, maximum efficiency competent 
cells). Cell aliquots were plated on L-agar nutrient media containing 
lOyg/ml tetracycline as selection for plasmid vector, and incubated 
over-night at 37 "^C. Colonies containing plasmids with inserts of 
interest were identified by hybridisation. 

About 350 colonies were picked and plated onto duplicate 
sterile nitro-cellulose filters (Schleicher and Schull), pre-wet on 
plates of L-agar nutrient media containing lOyg/ml tetracycline as 
selection for plasmid vector, and incubated over-night at 37**C. One 
duplicate plate is stored at A'^C, and acts as a source of live cells 
for the colonies, the other plate is treated to denature and fix the 
DNA from the individual colonies to the nitro-cellulose. The 
nitro-cellulose filter is removed from the agar plate and placed in 
succession onto Uhatman filter papers soaked in : 

1. 10^ SDS for 2 minutes 

2. 0.5M NaOH, 1.5M NaCl for 7 minutes 

3. 0.5H NaOH, 1.5M NaCl for 4 minutes 

4. 0.5M NaOH, 1.5M NaCl for 2 minutes 

5. 0.5M Tris pH7.4, 1.5M NaCl for 2 minutes 

6. 2xSSC (standard saline citrate) for 2 minutes. 
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The filter is then placed on a Whatman filter paper soaked 
in lOxSSC and the denatured DNA is crossed linked to the 
nitro-cellulose by ultra violet light treatment (Spectrolinker XL- 1500 
UV crosslinker). The filters are then allowed to air dry at room 
temperature, and are then pre-hybridised at 60*^C for one hour in a 
solution of 6xSSC with gentle agitation (for example using a Techne 
HB-ID hybridizer), Pre-hybridization blocks non-specific DNA binding 
sites on the filters. 

In order to determine which colonies contain DNA inserts of 
interest, the DNA crosslinked to the nitro-cellulose filter is 
hybridised with a radio-labelled 32P-DNA probe prepared from HCPB PCR 
product of the pancreatic cDNA library (see above). About 50ng of DNA 
was labelled with SOpCi of ^^P-dCTP (-3000Ci/mMol) using T7 DNA 
polymerase in a total volume of 50]j1 (Pharmacia T7 Quickprime kit), 
and the reaction allowed to proceed for 15 minutes at 37**C. The 
labelled probe is then heated to 95**C for 2 minutes, to denature the 
double stranded DNA, immediately added to 10ml of 6xSSC at 60**C, and 
this solution used to replace the pre-hybridisation solution on the 
filters. Incubation with gentle agitation is continued for about 3 
hours at 60*C. After this time the hybridisation solution is drained 
off, and the filters washed twice at 60**C in 2xSSC for 15 minutes each 
time. Filters were then gently blotted dry, covered with cling film 
(Saran wrap or similar), and exposed against X-ray film (for example 
Kodak Xomat-ARS) over-night at room temperature. Following 
development of the film, colonies containing inserts of interest were 
identified as those which gave the strongest exposure (darkest spots) 
on the X-ray film. In this series of experiments about SOX of the 
colonies gave positive hybridisation. From this 12 colonies were 
chosen for further screening. These colonies were picked from the 
duplicate filter, streaked and maintained on L-agar nutrient media 
containing lOyg/ml tetracycline, and grown in L-broth nutrient media 
containing lOug/ml tetracycline. 

The selected isolates were checked by PCR for inserts of the 
correct size, using primers FSPTSl and 6HIS9E10R1BS1, (SEQ ID NO: 40 
and SEQ ID NO: 41), and for priming with an internal primer BPB2 (SEQ 
ID NO: 33) and FSPTl. BPB2 is designed to prime within the mature 
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gene and generate a fragment of about 430 base pairs. 

For PCR screening colonies of the selected isolates were 
picked and dispersed into 200ul of distilled water and heated at 100**C 
for 10 minutes in a sealed tube. The suspensions were then 
centrifuged for 10 minutes in a microfuge to pellet cell debris, and 
lyl of the supernatant used as the DNA template in PCR screening* 
Typically, 1^1 of supernatant was mixed with 20pMols of two 
oligonucleotide primers, FSPTl and 6HIS9E10R1BS1, or FSPTl and BPB2, 
dNTPs to a final concentration of 200mM, Taq polymerase reaction 
buffer, and 0.5U of Taq polymerase in a final volume of 20^1. The PCR 
incubation was carried out using 25 cycles of 94*'C for 1.5 minutes, 
50*»C for 2 minutes, and 72'^C for 2 minutes, followed by a single 
incubation of 72^C for 9.9 minutes at the end of the reaction. 

The PCR products were analysed for DNA of the correct size 
(about 1000 base pairs from primers FSPTSl to 6HIS9E10R1BS1, and about 
430 base pairs from primers FSPTSl to BPB2) by agarose gel 
electrophoresis. All twelve clones gave PCR DNA products of the 
correct size. Six of the clones were then taken for plasmid DNA 
preparation (using Qiagen Maxi kits, from 100ml of over-night culture 
at 37**C in L-broth with lOpg/ml tetracycline). These plasmid DNA 
preparations were then sequenced over the region of PCR product insert 
using an USB Sequenase DNA sequencing kit, which incorporates 
bacteriophage T7 DNA polymerase. Alternatively the DNA was sequenced 
using an automated DNA sequencing service (using ABI sequencing 
equipment). The clones were sequenced using several separate 
oligonucleotide primers. Three of the primers, known as 1504, 1590 
and 1731, were used to check the cloning junctions between the 
expression vector and the inserted gene (SEQ ID NOs: 42, 43 and 44), 
as well as giving sequence data from the start and end of the inserted 
gene. Other primers, including those known as 679, 677, 1802, and 
1280 (SEQ ID NOs: 33, 34, 45 and 35) were used to confirm the 
remainder of the inserted gene sequence. This plasmid containing the 
modified mature HCPB gene is known as pICI1712. The confirmed 
sequence of the cloned gene, showing amino acid translation, from the 
start of the PelB sequence to the end of the (His)g-c-myc tag is shown 
as SEQ ID NO: 46 with DNA numbering starting from 1 in the first codon 
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of PelB, and peptide numbering starting from 1 in the mature HCPB, 
To obtain controlled expression of the modified HCPB the 
pICI1712 plasmid DNA was transformed into calcium chloride 
transformation competent E.coli expression strains. Included amongst 
these strains were a number which were incapable of growing with 
arabinose as the major carbon source, and were chromosome deleted for 
the arabinose (Ara) operon. A preferred strain is known as MSD213 
(strain MClOOO of Casadaban et al, Journal of Molecular Biology, 138 , 
179-208, 1980), and has the partial genotype, F" Ara A(Ara-Leu) 
ALacX74 GalV GalK StrR, Another preferred strain is known as MSD525 
(strain MC1061) and has the genotype, AraD139 A(Ara Leu)7697 AUc74 
GalU HsdR RpsL. E.coli strains of similar genotype, suitable for 
controlled expression of genes from the AraB promoter in plasmid 
pICI266, may be obtained from The E.coli Genetic Stock Centre, 
Department of Biology, Yale University, CT, USA. Selection for 
transformation was on L-agar nutrient media containing lOyg/ml 
tetracycline, over night at 37**C. Single colonies were picked from 
the transformation plates, purified by streaking and maintained on 
L-agar nutrient media containing lOyg/ml tetracycline, and grown in 
L-broth nutrient media containing lOpg/ml tetracycline. 

All pICI1712 transformed expression strains were treated in 
the same manner to test for expression of the cloned HCPB gene. 

1. A single colony was used to Inoculate 10ml of L-broth nutrient 
media containing lOyg/ml tetracycline In a 25ml Universal container, 
and Incubated over night at 37**C with shaking. 

2. 75ml of L-broth nutrient media containing lOug/ml tetracycline 
pre-warmed to 37** C in a 250ml conical flask was Inoculated with 0.75 
ml (IXv/v) of the over-night culture. Incubation was continued at 
37**C with shaking, and growth monitored by light absorbance at 540nm. 
Induction of cloned protein expression was required during exponential 
growth of the culture, and this was taken as between 0.4 and 0.6 O.D. 
at 540nm, and generally took 90 to 150 minutes from Inoculation. 

3. When the cells had reached the required optical density the 
cultures were allowed to cool to approximately 30^C by placing the 
flasks at room temperature for 30 minutes. Arabinose was then added 
to a final concentration of 1% (w/v), and Incubation continued at 30'*C 



SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 



- 45 - 



PCr/GB96/01975 



with shaking for 4 to 6 hours. 

A. After incubation a final optical density measurement is taken, and 
the cells were harvested by centrifugation. The final O.D* 
measurement is used to calculate the the volume of protein acrylamide 
gel (Laemmli) loading buffer that is used to resuspend the cell 
pellet. For O.D. less than 1 a volume of lOul is used for each 0.1 
O.D. unit, and for an 0-D. greater than 1 a volume of 15^1 is used for 
each 0.1 O.D. unit. The Laemmli loading buffer consists of 0.125M 
Tris-HCl pH 6.8, containing 2% SDS, 2% p-mercaptoethanol, lOX glycerol 
and Q.IX Bromophenol blue. 

5. Following re-suspension the samples were denatured by heating at 
lOO'C for 10 minutes, and then centrifuged to separate the viscous 
cell debris from the supernatant. Expression samples, usually 20ul of 
the supernatant, typically were loaded onto 17% SDS acrylamide gels 
for electrophoretic separation of the proteins. Duplicate gels were 
generally prepared so that one could be stained for total protein 
(using Coomassie or similar stain and standard conditions), and the 
other could be processed to indicate specific products using Western 
analysis. 



For Western analysis proteins in the run gel were 
transferred to nylon membrane (Problot, Applied Biosystems for 
example), using a semi-dry electrophoresis blotting apparatus (Bio-rad 
or similar). Before and during processing care was taken to ensure 
that the membrane remained damp. After transfer of the proteins from 
the gel, further binding was blocked with a solution of 5% low fat 
milk powder (Marvel or similar) in phosphate buffered saline (PBS) at 
room temperature with gentle agitation for 5 hours. The membrane was 
then washed 3 times at room temperature with gentle agitation for 5 
minutes each time in PBS containing 0.05Z Tween 20. The washed 
membrane was then incubated with the primary antibody, monoclonal 9E10 
mouse anti-c-myc peptide (see above), at a suitable dilution 
(typically 1 in 10,000 for ascites or 1 in 40 for hybridoma culture 
supernatant) in PBS containing 0.05% Tween 20 and 0.5% low fat milk 
powder, at room temperature with gentle agitation over night. The 
membrane was then washed 3 times at room temperature with gentle 
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agitation for at least 5 minutes each time in PBS containing 0*05% 
Tveen 20. The vashed membrane vas then incubated vith the secondary 
antibody 9 horseradish peroxidase labelled anti-mouse IgG (typically 
raised in goat, such as A4416 from Sigma) > at a suitable dilution 
(typically 1 in 10,000) in PBS containing 0.05% Tveen 20 and 0.5X lov 
fat milk povder, at room temperature vith gentle agitation for at 
least three hours. The membrane vas then vashed 3 times at room 
temperature vith gentle agitation for at least 10 minutes each time in 
PBS containing Tveen 20. The membrane vas then processed using 

the Aroersham ECL Western detection kit methodology, and exposed 
against Amersham Hyperfilm ECL for 30 seconds in the first instance, 
and then for appropriate times to give a clear image of the expressed 
protein bands. Other methods of similar sensitivity for the detection 
of peroxidase labelled proteins on membranes may be used. 

Good expression of the cloned tagged HCFB in pICI266 
(pICI1712) vas demonstrated in E.coli strains MSD213 and MSD525 by the 
Coomassie stained gels shoving an additional strong protein band at 
about 35,000 Daltons vhen compared to vector (pICI266) alone clones, 
and a band of the same size giving a strong signal by Uestem analysis 
detection of the c-myc peptide tag. 

Reference Example 8 

Expression of mature HCPB from E. coli 

The method of cloning and expressing the mature HCPB in 
E.coli vas very similar to the method described in Reference Example 
7. Again pICI266 vas used as the cloning vector, but in this case the 
starting material for PCR of the mature HCFB gene vas plasmid 
pICI1712, the tagged gene in the expression vector. Tvo 
oligonucleotides, knovn as 2264 and 2265 (SEQ ID NOs: 48 and 49) vere 
used in the PCR reactions (instead of primers FSPTSl and 
6HIS9E10R1BS1) , using similar conditions to Reference Example 7 , but 
using pICI1712 DNA instead of pICI1698. The first, top strand, 
oligonucleotide, 2264, vas designed to prime on pICI1712 and to 
include the Ncol restriction enzyme site in the PelB leader sequence, 
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and to continue to the start of the inserted mature HCPB gene (DNA 
bases 36 to 66 inclusive in SEQ ID NO: 46). The second, bottom 
strand, oligonucleotide, 2265, was designed to prime at the end of the 
mature HCPB gene, prior to the start of the (His)g-c-myc tag sequence 
(complementary to DNA bases 965 to 987 inclusive in SEQ ID NO: 46), 
and to introduce translation termination codons (complementary to TAA 
TAA) at the end of the gene followed by an EcoRI (GAATTC) restriction 
enzyme site and fill-in bases* This oligo primes back into the gene 
in the PCR to isolate the mature gene sequence. 

An aliquot of the PCR product was analysed for DNA of the 
correct size (about 970 base pairs) by agarose gel electrophoresis and 
found to contain predominantly a band of the correct size. The 
remainder of the product from the reaction mix was purified in a 
similar manner to Reference Example 7 . The isolated DNA was 
restriction digested with enzymes Ncol and EcoRI, and a band of the 
correct size (about 940 base pairs) purified in a similar manner to 
Reference Example 7 . 

pICI266 double stranded DNA, prepared in a similar manner to 
Reference Example 7 ^ was restriction digested with Ncol and EcoRI 
enzymes, being very careful to ensure complete digestion. DNA of the 
correct size (about 5600 base pairs) was purified in a similar manner 
to Reference Example 7 . 

Aliquots of both restricted and purified DNA samples were 
checked for purity and concentration estimation using agarose gel 
electrophoresis compared with known standards. From these estimates 
ligation mixes were prepared to clone the HCPB gene into the pICI266 
vector in a similar manner to Reference Example 7 . 

Following the ligation reaction the DNA mixture was used to 
transform E.coli strain DH5a, colonies were picked and tested by 
hybridisation, in a similar manner to Reference Example 7 . 

Six of the clones were then taken for plasmid DNA 
preparation, which were then sequenced over the region of PCR product 
in a similar manner to Reference Example 7 . The clones were sequenced 
using six separate oligonucleotide primers known as 1504, 1802, 679, 
1280, 677 and 1731 (SEQ ID NOs: 42, 45, 33, 35, 34 and 44). From the 
sequencing results a clone containing a plasmid with the required 
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mature HCPB gene sequence was selected^ and is known as pICI1736. 

The confirmed sequence of the cloned gene, shoving amino 
acid translation I from the start of the FelB sequence to the EcoRI 
restriction site is shovn as SEQ ID NO: 50 with DNA numbering starting 
from 1 in the first codon of PelB, and peptide niirobering starting from 
1 in the mature HCPB. 

To obtain controlled expression of the mature HCPB, the 
pICI1736 plasmid DNA was transformed into calcium chloride 
transformation competent E.coli expression strains in a similar manner 
to Reference Example 7 . All pICI1736 transformed expression strains 
were treated in a similar manner to Reference Example 7 to test for 
expression of the cloned HCPB gene. However, in this case the 9E10 
monoclonal antibody specific for the c-myc peptide tag cannot be used 
in the Western analysis, as the mature HCPB has no C- terminal tag. 
therefore, the primary antibody was an anti-bovine carboxypeptidase A 
raised in rabbit (from Biogenesis) which had previously been shown to 
cross-react with purified human pancreatic carboxypeptidase B. the 
secondary antibody was an anti-rabbit IgG antibody labelled with 
horseradish peroxidase and raised in goat (Sigma A9169 or similar). 

Expression of the cloned mature HCPB in pICI266 (pICI1736) 
was demonstrated in E.coli strains MSD213 and MSD525 by the Coomassie 
stained gels showing an additional protein band at about 34,000 
daltons when compared to vector (pICI266) alone clones. A band of the 
same size gave a signal by Vestern analysis detection using the 
anti-bovine carboxypeptidase A. 

Reference Example 9 

Expression of mature HCPB from COS cells 

A gene encoding preHCPB was generated by PCR from pICI1698 
(Reference example 1). The PCR was set up with template pICI1689 
(lOpg) and ollgos SEQ ID NO 1 and SEQ ID NO 2 (lOOpMoles of each) in 
buffer (lOOyl) containing lOmM Tris-HCl (pH8.3), 50mM KCL, 1.5mH 
HgClj, 0.125mM each of dATP, dCTP, dGTP and dTTP and 2.5u Taq DNA 
poljnnerase (Amplitaq, Perkin-Elmer Cetus). The reaction was overlaid 

SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 



- 49 - 



PCT/GB96/01975 



with mineral oil (lOOpl) and incubated at 94*^0 for 1 rain, 53**C for 1 
rain and 72'»C for 2.5 min for 25 cycles, plus 10 min at 72*»C. The PCR 
product of 985bp was isolated by electrophoresis on a IX agarose 
(Agarose type I, Sigma A-6013) gel followed by excision of the band 
from the gel and isolation of the DNA fragment by use of Geneclean 
(Genedean II kit, Stratech Scientific Ltd. or Bio 101 Inc.). The 
Geneclean kit contains 1) 6M sodium iodide 2) a concentrated solution 
of sodium chloride, Tris and EDTA for making a sodium 
chloride/ethanol/water wash; 3) Glassmilk (TM)- a 1.5 ml vial 
containing 1.25 ml of a suspension of a specially formulated silica 
matrix in water. 

This is a technique for DNA purification based on the method 
of Vogelstein and Gillespie published in Proceedings of the National 
Academy of Sciences USA (1979) Vol 76, p 615. Alternatively any of 
the methods described in "Molecular Cloning - a laboratory manual" 
Second Edition, Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory, 1989) can be used. Briefly, the Geneclean procedure is as 
follows. To 1 volume of gel slice is added 3 volumes of sodium iodide 
solution from the kit. The agarose is melted by heating the mix at 
55*»C for 10 min then Glassmilk (5-lOul) is added, mixed well and left 
to stand for 10 min at ambient temperature. The glassmilk is spun 
down and washed 3 times with NEW WASH (SOOpl) from the kit. The wash 
buffer is removed from the Glassmilk which is to dry in air. The DNA 
is eluted by incubating the dried Glassmilk with water (5-10^1) at 
55**C for 5-10 min. The aqueous supernatant containing the eluted DNA 
is recovered by centrifugation. The elution step can be repeated and 
supernatants pooled. 

The preHCPB gene was digested for Ih at 37*»C with EcoRI and 
Hindlll in a lOOpl reaction containing lOOmM Tris-HCl (pH 7.5), lOmM 
magnesium chloride, 50mM NaCl, 0.025% triton X-lOO, and 25u each of 
Hindlll and EcoRI (New England Biolabs). The digested fragment was 
purified by agarose gel electrophoresis and GeneClean as described 
above for the uncut fragment and cloned into pBluescript (Stratagene 
Cloning Systems). 

pBluescript KS+ DNA (Spg) was digested to completion with 
EcoRI and Hindlll (25u each) in a lOOpl reaction as described above. 
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Calf* intestinal alkaline phosphatase (Ipl; New England Biolabs, 
lOu/ul) was the added to the digested plasmid to remove 5' phosphate 
groups and incubation continued at 37 ''C for a further 30 minutes. 
Phosphatase activity vas destroyed by incubation at 70**C for 10 
minutes. The EcoRI-Hindlll cut plasmid vas purified from an agarose 
gel as described above* The EcoRI-Hindlll digested preHCPB gene 
(50ng) vas ligated vith the above cut plasmid DNA in 20pl of a 
solution containing 30mM Tris-Hcl (pH7.8), lOmM MgClj, lOmM DTT, ImM 
ATP, 50 iJg/ml BSA and 400u T4 DNA ligase (Nev England Biolabs, Inc) at 
25^C for 4h. A Ipl aliquot of the reaction vas used to transform 20\xl 
of competent E. coli DH5a cells (MAX efficiency DH5a competent cells, 
Life Technologies Ltd) using the protocol provided vith the cells. 
Transformed cells vere plated onto L-agar plus lOOpg/ml Ampicillin. 
Potential preHCPB clones vere identified by PCR. Each clone vas 
subjected to PCR as described above for preparation of the preHCPB 
gene except that the mix vith the cells vas incubated at 94**C (hot 
start procedure) for 5 min prior to 25 cycles of PCR and oligos SEQ ID 
NOs 3 and 4 replace oligos SEQ ID NOs 1 and 2. A sample (lOpl) of the 
PCR reaction vas analysed by electrophoresis on a IX agarose gel. 
Clones containing the preHCPB gene vere identified by the presence of 
a 1.2kb PCR product. Clones producing the 1.2kb vere used for large 
scale plsamid DNA preparation and the sequence of the insert confirmed 
by DNA sequence analysis. The plasmid containing the preHCPB gene in 
pBluescript vas named pHF15. 

To generate vectors capable of expressing HCPB in eukaryotic 
cells the GS-System^"™^ system (Celltech Biologies) vas used (WO 
87/04462, WO 89/01036, WO 86/05807 and WO 89/10404). The procedure 
requires cloning the preHCPB gene into the Hindlll-EcoRI region of 
vector pEE12 [this vector is similar to pSV2.GS described in 
Bebbington et al. (1992) Bio/Technology 10, 169-175, vith a number of 
restriction sites originally present in pSV2.GS removed by 
site-directed mutagenesis to provide unique sites in the multi-linker 
region]. To construct the expression vector, plasmids pEE12 and pMF15 
vere digested vith EcoRI and Hindlll as described above. The 
appropriate vector (from pEE12) and insert (from pMF15) from each 
digest vere isloated from a 1% agarose gel and ligated together and 
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used to transform competent DH5a cells. The transformed cells were 
were plated onto L agar plus amplcillin (lOOpg/ml). Colonies were 
screened by PCR as described above, with oligos which prime within the 
CHV promoter (SEQ ID NO 5) and in the HCPB gene (SEQ ID NO 6). Clones 
producing a 1.365kb PCR product were used for large scale plasraid DNA 
preparation and the sequence of the insert confirmed by DNA sequence 
analysis. The plasmid containing the preHCPB sequence in pEE12 was 
named pHF48. 

A second eukaryotic expression plasmid, pEE12 containing the 
prepro sequence of preproHCPB was prepared as described above. Oligos 
SEQ ID NOs 7 and 8 were used in the initial PCR to isolate a gene for 
the prepro sequence from pMFlB (described in Reference Example 11). In 
this case the PCR was performed with a hot start procedure by first 
incubating the mix without Taq DNA polymerase for 5 min at 94**C. Taq 
DNA polymerase (2.5u) was then added and the PCR continued through the 
25 cycles as described above. The 360bp fragment was clone into 
pBluescript to give pMF66 and subsequently into pEE12 (screening by 
PCR with SEQ ID NOS 7 and 8) to give pMr67. 

For expression in eukaryotic cells, vectors containing genes 
capable of expressing preHCPB and the prepro sequence were 
cotransfected into COS-7 cells. COS cells are an African green monkey 
kidney cell line, CV-1, transformed with an origin-defective SV40 
virus and have been widely used for short-terra transient expression of 
a variety of proteins because of their capacity to replicate circular 
plasraids containing an SV40 origin of replication to very high copy 
number. There are two widely available COS cell clones, COS-1 and 
COS-7. The basic methodology for transfection of COS cells is 
described by Bebbington in Methods: A Companion to Methods in 
Enzyraology (1991) 2, p. 141. For expression of HCPB, the plasmid 
vectors pMF48 and pHF67 (4ug of each) were used to transfect the COS-7 
cells (2 X 10^) in a six-well culture plate in 2ml Dulbecco's Modified 
Eagle's Medium (DMEM) containing 10% heat inactivated foetal calf 
serum (PCS) by a method known as lipofection - cationic lipid-mediated 
delivery of polynucleotides [Feigner et al. in Methods; A Companion to 
Methods in Enzymology (1993) 5, 67-75). The cells were incubated at 
37**C in a COj incubator for 20h. The mix of plasmid DNA in serum-free 
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medium (200m1; OPTI-MEH Reduced Serum Medium; GibcoBRL Cat. No. 31985) 
was mixed gently with LIPOFECTIN reagent (12yl; GibcoBRL Cat. No. 
18292-011) and incubated at ambient temperature for 15min. The cells 
were washed with serum-free medium (2ml; OPTI-MEM). Serum-free medium 
(600m1j OPTI-HEM) was added to the DNA/LIPOFECTIN and the mix overlaid 
onto the cells which were incubated at 37**C for 6h in a COj incubator. 
The DNA containing medium was replaced with normal DMEM containing lOZ 
FCS and the cells incubated as before for 72h. Cell supernatants 
(250m1) were analysed for HCPB activity against Hipp-Arg (5h assay) as 
described in Reference Example 3. COS cell supernatants which had 
been treated with LIPOFECTIN reagent, but without plasmid DNA, 
hydrolysed 1.2% of the substrate, whereas the COS cell supernatants 
transfected with the mix of plasmids expressing preHCPB and prepro 
sequence hydrolysed 61% of the Hipp-Arg substrate. COS cells 
transfected with only the preHCPB plasmid hydrolysed Hipp-Arg at the 
level seen for COS cells which had been treated with LIPOFECTIN 
reagent alone. 

LIPOFECTIN Reagent is a 1:1 (w/w) liposome formulation of 
the cationic lipid 

N- f 1- (2 , 3-dioleyloxy ) propyl ] -n , n , n- trimethylaramonium chloride (DOTMA) 
and dioleoyl phosphatidylethanolamine (DOPE) in membrane filtered 
water. It binds sponaneously with DNA to form a lipid-DNA complex - 
see Feigner et al. in Proc. Natl. Acad. Sci. USA (1987) 84, 7431. 

Reference Example 10 

Expression of proHCPB from E. coli 

The method of cloning and expressing the pro-HCPB in E.coli 
was very similar to the method described in Reference Example 7 . 
Again pICI266 was used as the cloning vector, and the starting 
material for PCR of the pro-HCPB gene was plasmid pICI1698 (as 
described in Reference Example 6 ). Two oligonucleotides, known as 
2310 and 2265 (SEQ ID NOs: 52 and 49) were used in the PCR reactions 
(instead of primers FSPTSl and 6HIS9E10R1BS1) , using similar 
conditions to Reference Example 7 . 
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The first, top strand, oligonucleotide, 2310, vas designed 
to prime on pICI1698, and to add the Ncol restriction enzyme site from 
the PelB leader sequence (DNA bases 51 to 66 inclusive in SEQ ID NO: 
46) to the start of the inserted pro-HCPB gene (DNA bases 40 to 57 
inclusive in SEQ ID NO: 38)- The second, bottom strand, 
oligonucleotide, 2265, vas designed to prime at the end of the mature 
HCPB gene, prior to the start of the (His)g-c-myc tag sequence 
(complementary to DNA bases 965 to 987 inclusive in SEQ ID NO: 46), 
and to introduce translation termination codons (complementary to TAA 
TAA) at the end of the gene followed by an EcoRI (GAATTC) restriction 
enzyme site and fill-in bases. This oligo primes back into the gene 
in the PCR to isolate the pro-gene sequence. 

An aliquot of the PCR product was analysed for DNA of the 
correct size (about 1240 base pairs) by agarose gel electrophoresis 
and found to contain predominantly a band of the correct size. The 
remainder of the product from the reaction mix was purified in a 
similar manner to Reference Example 7 . The isolated DNA was 
restriction digested with enzymes Ncol and EcoRI, and a band of the 
correct size (about 1210 base pairs) purified in a similar manner to 
Reference Example 7 . 

pICI266 double stranded DNA, prepared in a similar manner to 
Reference Example 7 . was restriction digested with Ncol and EcoRI 
enzymes, being very careful to ensure complete digestion. DNA of the 
correct size (about 5600 base pairs) was purified in a similar manner 
to Reference Example 7 > 

Aliquots of both restricted and purified DNA samples were 
checked for purity and concentration estimation using agarose gel 
electrophoresis compared with known standards. From these estimates 
ligation mixes were prepared to clone the pro-HCPB gene into the 
pICI266 vector in a similar manner to Reference Example 7 . 

Following the ligation reaction the DNA mixture was used to 
transform E.coli strain DH5a, colonies were picked and tested by 
hybridisation, in a similar manner to Reference Example 7 . 

Four positive hybridisation isolates were checked by PCR for 
inserts of the correct size, using primers 2310 and 2265, (SEQ ID NOs: 
52 and 49), and for priming with a pair of internal primers 1279 (SEQ 
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ID NO: 36) and 679 (SEQ ID NO: 33) In a similar manner to Reference 
Example 1 . The PCR products were analysed for DNA of the correct size 
(about 1200 base pairs from primers 2310 to 2265 » and about 580 base 
pairs from primers 1279 to 679) by agarose gel electrophoresis. All 
clones gave PCR DNA products of the correct size. 

All four of the clones were then taken for plasmid DNA 
preparation, and were then sequenced over the region of PCR product in 
a similar manner to Reference Example 7 . The clones were sequenced 
using six separate oligonucleotide primers known as 1504, 1802, 679, 
1281, 1590 and 1592 (SEQ ID NOs: 42, 45, 33, 37, 53 and 54). From the 
sequencing results a clone containing a plasmid with the required 
pro-HCPB gene sequence was selected, and is known as pICI1738. 

The confirmed sequence of the cloned pro-HCPB gene in 
pICI1738, showing amino acid translation, from the start of the PelB 
sequence to the EcoRI restriction site is shown as SEQ ID NO: 55 with 
DNA numbering starting from 1 in the first codon of PelB, and peptide 
numbering starting from 1 in the mature HCPB. 

To obtain controlled expression of the pro-HCPB the pICI1738 
plasmid DNA was transformed into calcium chloride transformation 
competent E.coli expression strains in a similar manner to Reference 
Example 7 . All pICI1738 transformed expression strains were treated 
in a similar manner to Reference Example 7 to test for expression of 
the cloned HCPB gene. However, in this case the 9E10 monoclonal 
antibody specific for the c-myc peptide tag cannot be used in the 
Western analysis, as the pro-HCPB has no C-terminal tag. Therefore, 
the primary antibody was an anti-bovine carboxypeptidase A raised in 
rabbit (from Biogenesis) which had previously been shown to 
cross-react with purified human pancreatic carboxypeptidase B. The 
secondary antibody was an ant i -rabbit IgG antibody labelled with 
horseradish peroxidase and raised in goat (Sigma A0545 or similar). 

Expression of the cloned pro-HCPB in pICI266 (pICI1738) was 
demonstrated from E.coli by the Coomassie stained gels showing an 
additional protein band at about 40,000 Daltons when compared to 
vector (pICI266) alone clones, and clones producing the tagged HCPB 
( Reference Example 7 ). A band of the same size gave a signal by 
Western analysis detection using the anti-bovine carboxypeptidase A. 
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Reference Example 11 

Expression of proHCPB from COS cells 

A gene for preproHCPB vas prepared by PCR as described in 
Reference Example 9 using as template pICI1689 and oligos SEQ ID NOS 1 
and 7 to give a 1270bp PCR product. The gene vas digested with EcoRI 
and Hindlll and cloned initially into pBluescript KS-i- (to give pHFlS) 
then into pEE12 in DH5a (to give pMF49) as described in Reference 
Example 9. Flasmid pHF49 was trans fected into COS- 7 cells by use of 
LIPOFECTIN reagent as described in Reference Example 9 and cell 
supernatants (250vil) assayed for HCPB activity against Hipp-Arg (5h 
assay), as described in Reference Example 3, following activation with 
trypsin (700Mg/ml) in 50mM Tris-Hcl (pH7.6), 150mM NaCl at 4**C for Ih. 
Under these condition, complete hydrolysis of the Hipp-Arg substrate 
was achieved, whereas supernatant from COS cells which had been 
treated with LIPOFECTIN reagent alone (without plasmid DNA) when 
activated with trypsin hydrolysed 30% of the Hipp-Arg substrate. 

Reference Example 12 
Purification of native HCPB 

A system has been determined for the initial purification of 
the native and the different mutant enzjnnes via two routes. 

The preferred route is described first. Recombinant E.coli 
cell paste containing the recombinant enzyme was taken from storage at 
-70*'C and allowed to thaw. The weight of cell paste was measured in 
grams and the paste resuspended with the addition of buffer A ( 200mH 
Tris (hydroxymethyl)aminome thane hydrochloride (TRIS-HCl), 20X 
sucrose, pH 8.0 ) to a volume equal to the initial weight of the cell 
paste. The cell suspension was incubated at room temperature for 20 
minutes with occasional gentle mixing before an equal voliune of 
distilled water was added and thoroughly mixed in. The cell suspension 
was again incubated at room temperature for 20 minutes with occasional 
gentle mixing. The resulting crude osmotic shockate was clarified by 
centrifugation at 98000 x g for 90 minutes at 4*»C after which the 
supernatant was decanted off from the pelleted insoluble fraction. 
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Deoxyribonuclease 1 was added to the supernatant to a final 
concentration of 0«1 mg/ml The mixture vas incubated at room 
temperature, with continuous shaking, until the vicosity vas reduced 
enough for it to be loaded on to a Carboxypeptidase Inhibitor CNBr 
activated Sepharose affinity column, prepared according to instructions 
vith the CNBr activated Sepharose AB from Pharmacia and 
carbox3rpeptidase inhibitor from potato tuber (c-0279, Sigma) . The 
supernatant vas adjusted to pHS.O and loaded on to the affinity 
column, pre-equilibrated vith lOmM TRIS-HCl, 500mM sodium chloride, pH 
8.0. After loading the supernatant the column vas vashed until the 
absorbance of the flov through vas back to baseline before the bound 
material vas eluted from the column by elution buffer (lOOmN sodium 
carbonate, 500mM sodium chloride, pH 11.4). The eluted fractions vere 
frozen at -20**C vhilst those containing the recombinant 
carboxypeptidase vere determined by Western blot analysis using an 
anti- c-myc tag antibody ( 9E10), folloved by an anti-mouse -horse 
raddish peroxidase conjugate (a-9044, sigma) that gave a colour 
reaction vith exposure to 4-chloro-naphthol and hydrogen peroxide. 

Fractions containing the recombinant carboxypeptidase B vere 
pooled, concentrated and the pH adjusted to pH 7.5 before being 
snap-frozen and stored at -20**C. Further purification of the pooled 
sample, utilising knovn methods such as ion exchange and gel 
permeation chromatography may performed if required. 

The second route involves the total lysis of the E.coli 
cells as opposed to a periplasmic shock, as used in the preferred 
route. 

Recombinant E.coli cell paste containing the recombinant 
enzyme vas taken and resuspended in lysis buffer (50mH TRIS-HCl, 15% 
Sucrose, pH 8.0). Lysozyme vas added to a concentration of Img/ml and 
at the same time lithium dodecyl sulphate (LDS) vas added (SOpl of a 
257. solution per 25ml of suspension) • The suspension vas incubated on 
ice for 30minutes vith occasional shaking, folloved by the addition 
deoxyribonuclease 1 to a concentration of Img/ml and again the 
suspension vas incubated on ice for 30 minutes vith occasion shaking. 
The suspension vas subsequently divided in to 200ml volumes and 
sonicated to complete the disruption of the cells for 10 x 30 sec 
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bursts with 30sec intervals between bursts. Sonicated suspensions 
were centrifuged at 98,000x g for 90 minutes at 4^*0 after which the 
supernatant was decanted off from the pelleted insoluble fraction. 
The supernatant was adjusted to pH 8.0 and loaded on to the affinity 
column, pre-equilibrated with lOmM TRIS-HCl, 500mM sodium chloride, pH 
8.0. After loading the supernatant the column was washed until the 
absorbance of the flow through was back to baseline before the bound 
material was eluted from the column by elution buffer (lOOmM .sodium 
carbonate, SOOmH sodium chloride, pH 11.4). The eluted fractions were 
frozen at -20oC whilst those containing the recombinant 
carbox3rpeptidase were determined by western blot analysis using an 
anti- c-myc tag antibody (9E10), followed by an anti-mouse -horse 
raddish peroxidase conjugate (a-9044, sigma) that gave a colour 
reaction with exposure to 4-chloronaphthol and hydrogen peroxide. 
Fractions containing the recombinant carboxypeptidase B were pooled, 
concentrated and the pH adjusted to pH 7.5 before being snap-frozen 
and stored at -20*^0. Further purification of the pooled sample, 
utilising known methods such as ion exchange and gel permeation 
chromatography may performed if required. 

Samples of the pooled material from both routes, analysed by 
SDS-PAGE and Coomassie stained nitrocellulose blot provided Coomassie 
stained bands at the correct molecular weight for the recombinant 
carboxypeptidase B's. These bands sequenced by an automated 
protein/peptide sequencer using the Edman degradation technique gave 
positive matches for the particular recombinant carboxypeptidase D 
being purified. 

Reference Example 13 

Expression of murine A5B7 F(ab')2-HCPB fusion protein from COS cells 

A particular antibody capable of binding with a tumour 
associated antigen is mouse monoclonal antibody A5B7. Antibody A5B7 
binds to human carcinoembryonic antigen (CEA) and is particularly 
suitable for targeting colorectal carcinoma. A5B7 is available from 
DAKO Ltd., 16 Manor Courtyard, Hughenden Avenue, High Wycombe, Bucks 
HP13 5RE, England, United Kingdom. Antibody fragments can be prepared 
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from whole IgG antibody by conventional means such as for example 
F(ab')2 fragments as described by Mariani, M. et al (1991), Molecular 
Immunology 28, 69-77. In general the antibody (or antibody 
fragment) - enzyme conjugate should be at least divalent, that is to 
say capable of binding to at least 2 tumour associated antigens (which 
may be the same or different). Antibody molecules may be humanised by 
known methods such as for example by *'CDR grafting** as disclosed in 
EP239400 or by grafting complete variable regions onto human constant 
regions as disclosed in US 4816567. Humanised antibodies may be 
useful for reducing immunogenicity of an antibody (or antibody 
fragment). A humanised version of antibody A5B7 has been disclosed in 
PCT W092/01059. 

The hybridoma which produces monoclonal antibody A5B7 was 
deposited with the European Collection of Animal Cell Cultures, 
Division of Biologies, PHLS Centre for Applied Microbiology and 
Research, Porton Down, Salisbury, Wiltshire SPA OJG, United Kingdom. 
The date of deposit was 14th July 1993 and the accession number is No. 
93071411. Antibody A5B7 may be obtained from the deposited hybridoma 
using standard techniques known in the art such as documented in Fenge 
C, Fraune E & Schuegerl K in ••Production of Biologicals from Animal 
Cells in Culture- (Spier RE, Griffiths JR & Meignier B, eds) 
Butterworth-Heinemann, 1991, 262-265 and Anderson BL & Gruenberg ML in 
"Commercial Production of Monoclonal Antibodies^^ (Seaver S, ed). 
Marcel Dekker, 1987, 175-195. The cells may require re-cloning from 
time to time by limiting dilution in order to maintain good levels of 
antibody production. 

This example describes the preparation of cDNA from the A5B7 
hybridoma, the isolation of specific Fd and light chain fragments by 
PCR, determination of the complete DNA sequence of these fragments, 
the subsequent preparation of an Fd-HCPB fusion gene and a 
co-expression vector capable of producing both light chain and Fd-HCPB 
fusion protein in eukaryotic cells, expression of the F(ab02"KCPB 
from COS cells by co-trans fee tion with a prepro sequence from HCPB. 
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a) Preparation of mRNA from hvbrldoma cells 

There are several procedures for the isolation of polyA+ 
mBNA from eukaryotic cells (Sambrook J,, Fritsch Maniatis T,, 

Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, Second Edition, 1989, Chapter 8 p3 hereinafter referred to as 
Maniatis). One such method is provided in kit form by Pharmacia and 
relies on the lysis of a relatively small number of cells (10^ or 
less) followed by binding of polyA+ mRNA to an oligo dT column. 
Unwanted cell components are removed by washing with a low salt 
concentration before eluting the mRNA in high salt solution at 
elevated temperature. 

mRNA was prepared from 10^ A5B7 hybridoma cells using the 
Quickprep mRNA kit (Pharmacia Biotechnology Ltd.). The concentration 
of the mRNA was estimated by scanning a sample from 300-220nm in a 
Uvikon 930 spectrophotometer (Kontron Instruments) and using an 
extinction coefficient of AOpg/ml at 260nm. The mRNA was stored as 
2.5pg aliqouts precipitated in ethanol. 

b) cDNA synthesis . 

The method used for cDNA synthesis was based on that of 
Gubler and Hofman which relies on reverse transcription from primed 
mRNA followed by RNAse H treatment to provide priming and sjmthesis of 
the second strand by DNA polymerase I. Other methods for the 
S3mthesis of cDNA are reviewed in Maniatis (Chapter 8). 

A 5iJg sample of mRNA was primed with oligo dT (12-18mer 
mixture, Pharmacia Biotechnology Ltd., O.Syg) in a lOyl solution 
containing 2.5u placental RNAse inhibitor (Life Technologies Ltd.) 
made up with RNAse-free water by incubating at 70**C followed by 
cooling on ice. First strand cDNA S3mthesis was then performed by 
adding 4m1 5x H-RT buffer (250mM Tris, pH8.3, 200mM KCl, 30mM MgClj 
and 0.5mg/ml BSA), 2^1 O.IM DTT (dithiothreitol) , lyl dNTP mix 
(dATP,dCTP,dGTP and dTTP at 20mM), Ajil Superscript Reverse 
transcriptase (Life Technologies Ltd.) and incubating at 42**C for 1 
hour. For the second strand reaction, LSpl dNTP mix (as above), 
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92.5yl SNAse-free water, 30m1 5x reaction buffer (125iiiM Tris, pH7.5, 
500mM KCl, 25mM MgC12^ SOmM (^4)280^ and 0.5 mg/ml p-NAD), Ijil T4 DNA 
ligase (lOu, Life Technologies Ltd.), 4m1 DNA polymerase I (40u, Life 
Technologies Ltd.) and Ijil RNAse H (2.7u, Life Technologies Ltd.) were 
added and incubation continued at 16'C for a further 2 hours. To 
ensure that blunt-ended cDNA was prepared a final Incubation at 16»C 
for 5 minutes after adding 2yl T4 DNA polymerase (lOu, Life 
Technologies Ltd. ) was performed. Enzyme activity was then stopped by 
incubation at 70»C for 10 minutes. 



c) Isolation of antibody gene fragments bv PCR 

Isolation of A5B7 Fd and L chain fragments was performed 
using the cDNA as template. The Fd fragment was terminated 
immediately after the hinge sequence (c-terrainal threonine) 
hereinafter referred to as proteolytic type Fd. 

Material from the first-strand cDNA reaction or after 
completion of the second strand reaction is suitable as template. The 
material could be used neat from the completed reaction or as a 
dilution (up to 1 in 100) in double-distilled water. Oligonucleotides 
(SEQ ID numbers 13-19) were used in the generation of the Fd and L 
chain fragments. For each antibody fragment, the 5' region 
oligonucleotide (SEQ ID 13 for Fd fragment and SEQ ID 14 for the L 
chain) encoded a restriction enzyme site (Hindlll for Fd and EcoRI for 
L chain) a consensus Kozak sequence (GCCGCCACC) to maximise 
translation initiation and a portion of the natural murine signal 
sequence. The 3' region oligonucleotide for the proteolytic type Fd 
fragment (SEQ ID 15 was complementary to the 3' end of the antibody 
hinge region, encoded mutations to introduce tandem translation 
termination codons (TAG and TAA) immediately after the hinge and 
contained an EcoRI restriction enzyme site beyond this sequence. The 
3' region of the L chain was determined by an oligonucleotide (SEQ ID 
16) complementary to the end of the coding region, introduced an 
additional translation termination codon (TAA) and an EcoRI 
restriction site. In addition pairs of partially overlapping and 
complementary oligonucleotides for each fragment (SEQ IDS 17 and 18 
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for the Fd and SEQ IDS 19 and 65 for the L chain) were used to 
introduce silent mutations into each DNA strand resulting in the 
removal of a BamHI from the CHI of the Fd fragment and the VL of the L 
chain without altering the encoded amino-acid sequence. Each 5' and 
3' oligonucleotide was used with the appropriate mutagenic 
oligonucleotide to generate 2 mutated fragments of each antibody 
chain. After purification the two fragments were mixed in equal 
proportions and used as the templates for a second PCR reaction using 
the relevant 5' and 3' region oligonucleotides. The products of these 
reactions were the full-length Fd and L chain fragments without 
internal BamHI sites. 

In general, 5\xl of cDNA was added to a lOOul reaction 
containing lOmM Tris-HCl,pH 8-3, 50mM KCl, 0.1% gelatin, 1.5mM MgC12, 
1.25 mM each of dATP, dCTP, dGTP and dTTP, IpM each of an appropriate 
oligo pair and 2.5u Taq DNA polymerase (Amplitaq, Perkin-Elmer Cetus). 
Each reaction was overlaid with IOOmI mineral oil and incubated at 
94^C for 1.5 minutes, 50 or 55'*C for 1.0 minute and 72'»C for 2.0 
minutes for 25 cycles plus 10 minutes at 72*»C. Control reactions with 
no DNA were also set up. 

The PCR reactions were analysed by running a 5\il sample of 
each on a 0.8% agarose (Sigma Chemical Company Ltd.) gel which was 
subsequently stained in lyg/ml Ethidium Bromide (BDH Laboratory 
Supplies) solution and the DNA visualised on a UV transilluminator. 
Bands of the appropriate size were visible in all PCRs with A5B7 cDNA 
present indicating successful amplification of the fragments of the Fd 
and L chains. The absence of a DNA band in the control reactions 
indicated that the reagents used did not contain contaminating DNA. 

Each PCR product was purified by use of a Centricon 100 
filtration microconcentrator (Amicon Ltd.). Each reaction was added 
to a concentrator and the volume increased to 2ml by addition of 
double distilled water. The unit was then centrifuged at 500xg 
(Sorval RT6000B benchtop centrifuge with HIOOOB rotor) for 5 minutes 
and the "flow-through" discarded. The retentate was diluted to 2ml 
again and the unit re-centrifuged. The process was repeated for a 
third time. This procedure results in the removal of excess oligos 
and buffer components from the amplified DNA. These purified DNAs 
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were then used directly in subsequent PCR reactions. The appropriate 
pairs of fragments were mixed in equal proportions and aliquots used 
in the second PCRs with the respective 5' and 3' oligonucleotides. 

d) Subcloning the PCR generated fragments into pBluescript 

The products of the second PCR reactions shoved bands of 
approximately 775bp and 730bp consistent with the full-length Fd and L 
chains respectively. These products were also purified using Centricon 
100 microconcentrators as above. Each DHA product was then 
precipitated in a 1.5ml solution containing SOpl 3H sodium acetate , 
distilled water to 500ul and Iral of absolute ethanol. The solution 
was incubated on ice for at least 10 minutes before centrifugation at 
ll,600xg for 10 minutes (MSE MicroCentaur) . The supernatant was 
discarded and the pellet washed in 1ml 70% ethanol (v/v in distilled 
water) by centrifugation for a further 5 minutes. The supernatant was 
discarded and the DNA pellet dried under vacuum. Each DMA pellet was 
resuspended in distilled water. The Fd PCR product was then digested 
with EcoRI and Hindlll in a 200^1 reaction containing 20mM 
Tris-acetate, pH 7.9, lOmM magnesium acetate, 50mM potassium acetate. 
In* dithiothreitol (DTT), and 25u each of Hindlll and EcoRI (Promega 
Corporation). The L chain product was digested with EcoRI in a 30^1 
reaction containing 90mH Tris-HCl, pH7.5, lOmM magnesium chloride, 
50mM sodium chloride and lOu EcoRI. Digests were incubated at 37*C 
for 1 hr. 

The digested fragments were then purified by electrophoresis 
on a 0.75X SeaPlaque GTG agarose gel (FMC BioProducts Ltd) followed by 
excision of the appropriate bands from the gel. The agarose gel slice 
was redissolved by incubation at 65**C for 2 minutes, diluted to a 
final volume of 450ij1 with distilled water and SOyl 3M sodium acetate 
added. This solution was extracted with an equal volume of liquified 
phenol, equilibrated with Tris buffer pH7.6 (Fisons Scientific 
Equipment) using centrifugation at ll,600xg for 2 minutes (HSE 
HicroCentaur) to separate the aqueous and phenolic phases. The 
subsequent aqueous phase was re-extracted with a phenol: chloroform 
mixture (50:50 v:v) and again with chloroform prior to ethanol 
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precipitation as described above. Each purified pellet was 
resuspended in lOyl distilled water and a lyl sample visualised by 
electrophoresis on a 0.8X agarose gel to estimate quality and 
concentration. 

pBluescript (Stratagene Cloning Systems) was used for 
initial cloning of Fd and L chain cDNAs. This phagemid vector has 
unique EcoRI and Hindlll cloning sites, Ampicillin resistance gene, 
and both ColEI and fl replication origins for isolation of either 
double- or single stranded DNA. Syg pBluescript KS- DNA was digested 
to completion with 30u EcoRI (Promega Corporation) in a lOOpl reaction 
containing 90mM Tris-HCl, pH7.5, lOmM MgC12, 50mM NaCl or with EcoRI 
and Hindlll in a lOOyl reaction containing 20mM Tris-acetate, pH 7.9, 
lOraM magnesium acetate, 50mM potassium acetate, ImM dithiothreitol 
(DTT), and 25u each of EcoRI and Hindlll (Promega Corporation) at 37«C 
for 1 hour. 2ul calf- intestinal alkaline phosphatase (2u, Bohringer 
Mannheim) was the added to the EcoRI digested plasmid to remove 5' 
phosphate groups and incubation continued at 37 "C for a further 30 
minutes. Phosphatase activity was destroyed by incubation at 70»C for 
10 minutes. The EcoRI-Hindlll cut plasmid was purified from a 
SeaPlaque GIG agarose gel as described above. 

25 - 50ng of digested Fd or L chain PCR product was ligated 
with 50ng of EcoRI-Hindll or EcoRI/CIP treated pBluescript 
respectively in lOpl of a solution containing 30mM Tris-HCl, pH7.8, 
lOmM MgClj, lOmH DTT, ImM ATP and 1.5u T4 DNA ligase (Promega 
Corporation) at 16»C for 2.5 hours. A Ipl aliquot of each reaction 
was used to transform 20m1 of competent E.coli DH5a cells (Life 
Technologies Ltd.) using the protocol provided with the cells. 
Transformed cells were plated onto L-agar plus lOOyg/ml Ampicillin, 
ImM IPTG and 0.2% X-gal and incubated overnight at 37»C. Clones 
containing cloned inserts were selected on the basis of producing 
white colonies on the above medium compared to the blue colour 
generated by cells containing the parental plasmid. 
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e) DNA sequence analysis of cDNA clones 

The potential Fd and L chain cDNA clones identified by 
colour selection vere picked from the agar plates and used for large 
scale plasmid DNA preparation. Each clone was used to inoculate 200ml 
of L-broth plus lOOyg/ml ampicillin in a 500ml conical flask* The 
cultures were incubated, shaking at 37**C overnight. After growth the 
cells from each culture were pelleted by centrifugation at SOOOxg for 
10 minutes in a Sorvall RC5C centrifuge and GS3 rotor at 4**C. The 
cell pellet from each culture was resuspended in 20ml TE buffer and 
re-centrifuged at 2000xg for 10 minutes in a Sorvall RC5C centrifuge 
and SS-34 rotor in an oak-ridge tube at 4''C. Each washed cell pellet 
was resuspended in 3ml ice cold 25% sucrose, 50mH Tris, pH8.0, and 
left on ice. Fresh lysozjmie solution (1.0ml at lOmg/ml) was added, 
the contents mixed by rolling the tube and incubation on ice continued 
for 5 minutes. Sodium ethylene diamine tetracetate (EDTA) solution 
(1.0ml at O.SmM, pH8.5) was added and the contents gently mixed. 
Finally, 5.0ml of iced Triton X solution (0.1% Triton X-100, 62.5mM 
EDTA, 50mN Tris, pH8.0) was added, the contents gently mixed and 
incubation on ice continued for a further 10 minutes. The cell debris 
was then pelleted by centrifugation at 39,000xg for 30 minutes in a 
Sorvall RC5C centrifuge and SS-34 rotor at 4**C. The supernatant 
containing plasmid DNA was added to 16g caesium chloride (Boehringer 
Mannheim) and ISOpl ethidium bromide solution (lOmg/ml) and the volume 
increased to 18.5ml by addition of TE buffer. This solution was 
transferred to an 18.5ml crimp top, polypropylene centrifuge tube 
(Sorvall Instruments). The tube was sealed and centrifuged at 
180,000xg for 16 hours in a Sorvall TV865B (titanium, vertical) rotor 
and 0TD65B centrifuge at 18**C. 

After centrifugation, plasmid DNA was visible as a distinct 
orange band in the CsCl/EtBR density gradient which had formed. The 
plasmid DNA was removed from the gradient using a hypodermic syringe 
to pierce the tube wall. The sample taken from the gradient was 
diluted 3-4 fold with TE buffer and the DNA precipitated by addition 
of an equal volume of isopropyl alcohol and incubation on ice for 10 
minutes. The precipitated DNA was pelleted by centrifugation at 
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17,000xg in a Sorvall RC5C centrifuge and SS-34 rotor at 4»C and the 
supernatant discarded. The resulting pellet was washed in 70X ethanol 
(v/v) and re-centrifuged for 5 minutes. The pellet was then dried 
under vacuum, resuspended in 1.8ml TE buffer and 200m1 3M sodium 
acetate solution and extracted with an equal volume of phenol using 
centrifugation at 17,000xg for 2 minutes to separate the phases. The 
aqueous phase was re-extracted against an equal volume of chloroform 
before precipitating the DNA by addition of an equal volume of ethanol 
at -20»C and incubating on ice for 10 minutes. The purified DNA was 
pelleted as above, washed in 5ml 70% ethanol and the pellet vacuum 
dried. The dried pellet was resuspended in SOOul double-distilled 
water and DNA concentration estimated by scanning a diluted sample 
from 300 to 220nm in a UV spectrophotometer using and extinction 
coefficient of 50Mg/ml/OD260. A number of proprietary kits, e.g. 
Qiagen (Hybaid Ltd), are also available for plasmid DNA purification. 

This purified plasmid DNA was then used for DNA sequence 
analysis. Double stranded DNA can be used for DNA sequence analysis by 
the dideoxy chain termination method of Sanger (Proc. Nat. Acad. Sci. USA 
74, 1977, p5463) using a proprietary sequencing kit such as the 
Sequenase kit supplied by United States Biochemical Company and used 
in accordance with the protocols provided. 

Aliquots (2-4vig) of Fd and L chain cDNA clone plasmid DNA 
were used for DNA sequence analysis. Each aliquot was initially 
denatured by incubation with 0.2M NaOH, 0.2mM EDTA in a final volume 
of lOOul at room temperature for 10 minutes. The denatured DNA was 
then precipitated by addition of lOyl 3M sodium acetate (pH5.0) and 
275m1 ethanol and incubation on ice for 10 minutes. The precipitated 
DNA was recovered as described for plasmid DNA above. The denatured 
DNA was then primed for sequencing by incubation of each with 
O.Spmoles of an appropriate primer in lOpl of Sequenase reaction 
buffer (40mH Tris, pH7.5, 25mM MgCl2, 50raM NaCl) containing lOX 
di-methyl sulphoxide (DMSO) at 65»C for 2 minutes followed by gradual 
cooling to below 30»C. These primed templates were then used in 
sequencing reactions according to the protocols provided with 102 DMSO 
added to labelling and termination mixtures. 
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The sequencing reactions vere analysed by autoradiography 
after high resolution electrophoresis on a 6% polyacrylamide: 8H urea 
denaturing gel (Sanger and Coulson, 1978, FEES lett.87, pl07). 

The complete Fd and L chain sequences of the cloned cDNAs 
are given below (SEQ ID NO 20 for the proteolytic type Fd chain and 
SEQ ID NO 22 for L chain). The plasmid containing the proteolytic type 
Fd was named pAFl and the L chain pAF3. The presence of the silent 
mutation in each fragment for removal of the BamHI site was also 
confirmed. The DNA sequence indicates that the antibody is an IgGlic 
Isotype when compared to published constant region DNA sequence data 
(in Kabat, E.A., Wu, T.T., Bilofsky, Reid-Milner, H., Perry, H., 
1987, Sequences of Proteins of Immunological Interest, Fourth Edition, 
Public Health Service N.I.H. Washington DC). 

f) Preparation of Fd-HCPB fusion DNA sequence 

A gene encoding the C- terminal region of the Fd sequence, 
from the Ncol site in SEQ ID NO 20 (position 497) was joined to the 
HCPB sequence by PCR. In this process DNA for an 8 amino-acid linker 
sequence (VPEVSSVF; SEQ ID NO: 67) was introduced. Plasmid pAFl was 
subjected to PCR (hot start procedure) as described in Reference 
Example 9 with oligos SEQ ID NOS 9 and 10 to give a 338bp product. 
Similarly, pICI1698 was subjected to PCR with oligos SEQ ID NOS 11 and 
1 to give a 998bp product. Both products were isolated by agarose gel 
electrophoresis and Geneclean as described in Reference Example 9 and 
used (0.2ng each in SOpl total volume) in a second hot start PCR with 
10 cycles for 1 mln at 94**C and 4 rain at 63**C followed by 2 min at 
94**C. Flanking oligos (SEQ ID NOS 9 and 1; lOOpM each) were added in 
50yl buffer with Amplitaq (2.5u). After heating to 94'»C for 3 min, 
the mix was subjected to 25 cycles of 1.5 min at 94**C, 2 min at SS^'C 
and 2 min at 72**C followed by 10 min at 72**C. The product was a band 
at 1336bp, isolated as described previously, then cut with EcoRI and 
Hindlll and cloned into pBluescript in DH5a (clones were screened by 
PCR with oligos SEQ ID NOS 3 and 4) to give pMF35 To make the 
complete Fd-HCPB fusion sequence, plasmids pAFl and pMF35 were cut 
(lOpg of each) with Ncol and EcoRI for 2h in buffer (100^1) containing 
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SOniM potassium acetate, 20mM Tris-acetate (pH 7.9), lOmM MgCl2, ImM 
DTT, EcoRI (40u) and Ncol (20u). The vector fragment (3.4kb) from 
pAFl was isolated and treated with calf intestinal alkaline 
phosphatase as described in Reference Example 9 and ligated to the 
purified 1.2kb fragment from pMFSS. The resulting vector was cloned 
in DH5o (screened by PCR with oligos SEQ ID NOS 3 and 4 for a l,922bp 
insert) and named pHF39. The EcoRI-Hindlll fragment from pMFSg'was 
cloned into pEE6 (this is a derivative of pEE6.hCMV - Stephens and 
Cockett (1989) Nucleic Acids Research 17, 7110 - in which a Hindlll 
site upstream of the hCMV promoter has been converted to a Bglll site] 
in DH5a (screened by PCR with oligos SEQ ID NOS 5 and 6 for a 2,200bp, 
approximately, insert) to give pMF43. 

Plasmid pAF3 (described above in e) and pEE12 fthis vector 
is similar to pSV2.GS described in Bebbington et al. (1992) 
Bio/Technology 10, 169-175, with a number of restriction sites 
originally present in pSV2.GS removed by site-directed mutagenesis to 
provide unique sites in the multi-linker region] . The appropriate 
vector and insert fragments from each digest were then isolated from 
Seaplaque GTG agarose and ligated together and used to transform 
competent DH5a cells also as described earlier. The transformed 
cells were plated onto L agar plus lOOyg/ml ampicillin. Screening of 
colonies from the transformation was by a PCR method. Colonies were 
transfered into 200ul distilled water and mixed by vortexing. The 
suspended cells were then heated to lOO'C for 1 minute and centrifuged 
at ll,600xg for 2 minutes prior to using the supernatant in a PCR 
reaction. In each PCR reaction, an oligo which primes within the CMV 
promoter (SEQ ID 5) was used with the oligo complementary to the 3' 
region of the light chain (SEQ ID 16) as appropriate. Only clones 
with the antibody fragment gene inserted in expressing orientation 
downstream from the CMV promoter will produce specific PCR products of 
approximately 2.0kb. The resulting plasmid was named pAF6. To make 
the co-expression vector, pMF43 (lOpg) was cut with Bglll (20u) and 
Sail (40U) in buffer (lOOpl) containing lOmM Tris-HCl (pH 7.9), 150mM 
NaCl, lOmH MgClj, ImM DTT and BSA (lOOpg/ml) and the 4348bp fragment 
isolated by agarose gel electrophoresis and purified with Geneclean as 
described previously. Similarly, pAF6 was cut with BaraHI (40u) and 
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Sail (40u) and the 7.8kb vector fragment isolated and ligated to the 
Bglll-Sall fragment from pMF43 and cloned into DH5a. Colonies were 
screened by PCR vith 2 sets of oligos (SEQ ID NOS 14 and 12, and SEQ 
ID NOS 13 and 6). Clones giving PCR products of 360bp and 1.3kp 
respectively were characterised by DNA sequencing. A clone vith 
correct sequence was named pHF53 - light chain/Fd-HCFB co-expression 
vector in DH5a. 

g) Expression of A5B7 F(abM2*HCPB in COS cells 

The procedure described in Reference Example 9 for 
co-transfection of COS-7 cells with the plasmid encoding the prepro 
sequence (pMF67) was repeated with pMF48 replaced by pMF53. COS cell 
supernatants were examined for HCPB activity as described in Reference 
Examples 3 and 9. COS cell supernatants which had been treated with 
LIPOFECTIN reagent, but without plasmid DNA, hydrolysed of the 

substrate, whereas the COS cell supernatants transfected with the mix 
of plasmids expressing light chain/Fd-HCPB and prepro sequence 
hydrolysed 34% of the Hipp-Arg substrate. COS cells transfected with 
only pHF53 plasmid hydrolysed Hipp-Arg at the level seen for COS cells 
which had been treated with LIPOFECTIN reagent alone. By Western 
analysis (see h below) bands of approximatey 80kDa and 160kDa were 
visible, corresponding to Fab'-HCPB and F(ab' )2-(HCPB)2 respectively. 
In a CEA ELISA assay (see i and j below) cell supernatants (see above) 
were used to detect the presence of CEA binding material according to 
the protocol given in j. 

h) Western blot analysis 

Western blot analysis was performed as described as follows: 

Aliquots (20m1) of each supernatant sample were mixed with 
an equal volume of sample buffer (62.5mH Tris, pH6.8, 1% SDS, 10% 
sucrose and 0.05% bromophenol blue) with and without reductant. The 
samples were incubated at 65 ''C for 10 minutes before electrophoresis 
on a 8-18% acrylamide gradient gel (Excel gel system from Pharmacia 
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Biotechnology Products) in a Multiphor II apparatus (LKB Produkter AB) 
according to the manufacturer's instructions. After electrophoresis, 
the separated proteins were transfered to a Hybond C-Super membrane 
(Amersham International) using a Novablot apparatus (LKB Produkter AB) 
according to protocols provided by the manufacturer. After blotting, 
the membrane was air dried. 

The presence of antibody fragments vas detected by the use 
of an anti-murine F(ab')2 antibody-peroxidase conjugate (ICN 
Biomedicals, product no. 67-430-1). The presence of murine A5B7 
antibody fragments was visualised using the ECL detection system 
(Amersham International) according to the protocol provided. 

i) ELISA analysis 

Standard procedures for ELISA assay are available in 
"Laboratory Techniques in Biochemistry and Molecular Biology" eds. 
Burdon, R.H. and van Kippenberg, P.H., volume 15, "Practice and Theory 
of Enzyme Immunoassays", Tijssen, P., 1985, Elsevier Science 
Publishers B.V.. Another source of information is "Antibodies - A 
Laboratory Manual" Harlow, E. and Lane, D.P. 1988, published by Cold 
Spring Harbor Laboratory. 

j) ANTI-CEA ELISA 

1. Prepare coating buffer (1 capsule of Carbonate-Bicarbonate buffer - 
Sigma C-3041 - in 100ml double distilled water). 

2. Add 5\il of CEA stock solution (Img/ml, Dako) to 10ml of coating 
buffer for each 96 well plate required. 

3. Add lOOpl of diluted CEA to each well of a Nunc "Maxisorp" 
microti tre plate - 50ng/well/100Ml. 

4. Incubate plates at 4**C overnight (or room temp, for 2 hours). 
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5* yash plates 4 times for 5 minutes each with Phosphate buffered 
saline + 0.01% Sodium azide (PBSA) + 0.05X Tween 20. 

6. Block plates (after banging dry) with IZ BSA (Sigma A-7888) in PBSA 
containing 0.05X Tween 20 at 200ul per well. Incubate at room temp, 
for 2 hours. 

7. Wash plates A times for 5 minutes each with PBSA containing 0.05X 
Tween 20. 

8. Load samples (culture supernatants) and standards (doubling 
dilutions of proteolytic A5B7 F(ab')2) appropriate. Dilute samples 
in growth medium (or PBS). Include PBSA + IX BSA and diluent as 
blanks . 

9. Incubate at ambient temperature for 3h. 

10. Wash plates 6 times for 5 minutes each with PBSA +0.5% Tween 20. 

11. Prepare secondary antibody solution (anti-mouse IgG F(ab')2, from 
goat, peroxidase conjugated - ICN 67-430-1 - at 20iil in 40ml PBSA + 11 
BSA +0.5% Tween 20) and add lOOpl per well. 

12. Incubate at room temp, for Ih. 

13. Wash plates 6 times for 5 minutes each with PBSA +0.5% Tween 20. 

14. Prepare developing solution by dissolving 1 capsule of 
Phosphate-Citrate Perborate buffer (Sigma P-4922) in 100ml double 
distilled water. Add 30mg o-Phenylenediamine Dihydrochloride (OPD, 
Sigma P-8287) per 100ml buffer. Add ISOjil per well. 

15. Incubate at room temp, in darkness for 15 minutes. 

16. Stop reaction by addition of 50vil per well of 2M Sulphuric acid. 
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17 « Read OD 490nm in plate reader. 
Example 1 

Cloning and expression of D253K HCPB-CHisjg-c-Hyc from E. coli 

The method of cloning and expressing the D253K-HCPB in 
E.coli was very similar to the method described in Reference Example 
7. Again pIC1266 was used as the cloning vector, and the starting 
material for PCR of the pro-HCPB gene was plasmid pICI1698 (as 
described in Reference Example 6 ). However, in this case site 
directed mutagenesis was used during the PCR amplification of the gene 
to change the codon at amino acid position 253 in the mature gene from 
Aspartate to Lysine (GAC to AAA), the D253K change- Two PCR mixtures 
were prepared, in a manner similar to that described in Reference 
Example 7 . In the first reaction primers were FSPTSl (SEQ ID NO: 40) 
and 1398 (SEQ ID NO: 57). In the second reaction primers were 
6HIS9E10R1BS1 (SEQ ID NO: 41) and 1397 (SEQ ID NO: 58). In both 
reactions the starting DNA was pICI1698. Primers 1398 and 1397 (SEQ 
ID NOs: 57 and 58) are designed to anneal around amino acid codon 253, 
introduce the GAC to AAA change in the DNA sequence, and produce 
complementary sequence at the ends of the two PCR products. The other 
two primers, FSPTSl and 6HIS9E10R1BS1 (SEQ ID NOs: 40 and 41) are 
described in Reference Example 7 . Aliquots of the two PCR reactions 
were analysed for DNA of the correct size (about 750 and 250 base 
pairs) and estimation of concentration by Agarose gel electrophoresis, 
and found to contain predominantly bands of the correct size. Another 
PCR was then set up using approximately 4ng of each of the first two 
PCR products, in the presence of dNTPs to a final concentration of 
200uM, Taq poljnnerase reaction buffer, 2U of Taq polymerase in a final 
volume of 80|al. The mixture was heated at 94*^0 for 10 minutes prior 
to the addition of the Taq enzyme, and PCR incubation was carried out 
using 10 cycles of 94**C for 1 minute and 63**C for 4 minutes. On 
completion of these cycles the reaction mix was made up to 120^1 by 
the addition of 120pmols of each end primer, FSPTSl and 6HIS9E10R1BS1 
(SEQ ID NOs: 40 and 41), additional dNTPs (approximately an extra 
lOOpM), Taq polymerase reaction buffer, and 4U of Taq poljmierase. The 
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mixture was heated at 94 ^'C for 10 minutes prior to addition of Taq 
enzyme, and the FCR Incubation vas carried out using 30 cycles of 94**C 
for 1.5 minutes, 50 ""C for 2 minutes, and 72"* C for 2 minutes, followed 
by a single incubation of 72 ''C for 9.9 minutes at the end of the 
reaction. 

An aliquot of the FCR product vas analysed for DNA of the 
correct size (about 1000 base pairs) by agarose gel electrophoresis 
and found to contain predominantly a band of the correct size. The 
remainder of the product from the reaction mix vas purified in a 
similar manner to Reference Example 7 . The isolated DNA vas 
restriction digested vith enzymes Fspl and EcoRI, and a band of the 
correct size (about 1000 base pairs) purified in a similar manner to 
Reference Example 7 . 

plCI266 double stranded DNA, prepared in a similar manner to 
Reference Example 7 , vas restriction digested vith Kpnl enzjrme, and 
blunt-end treated vith T4 DNA polymerase being very careful to ensure 
complete digestion. The purified DNA vas then digested vith 
restriction enzsnme EcoRI. DNA of the correct size (about 5600 base 
pairs) vas purified in a similar manner to Reference Example 7 . 

Aliquots of both restricted and purified DNA samples vere 
checked for purity and concentration estimation using agarose gel 
electrophoresis compared vith knovn standards. From these estimates 
ligation mixes vere prepared to clone the HCFB gene into the pICI266 
vector in a similar manner to Reference Example 7 . 

Folloving the ligation reaction the DNA mixture vas used to 
transform E.coli strain DH5a, colonies vere picked and tested by 
hybridisation, in a similar manner to Reference Example 7 . 

Six positive hybridisation isolates vere checked by FCR for 
inserts of the correct size, using primers FSFITSI and 6HIS9E10R1BS1 
(SEQ ID NOs: 40 and 41), and for priming vith an internal primer 
FSFTSl (SEQ ID NO: 40) and 679 (SEQ ID NO: 33) in a similar manner to 
Reference Example 7 . The FCR products vere analysed for DNA of the 
correct size (about 1000 base pairs from primers FSFTSl to 
6HIS9E10R1BS1, and about 430 base pairs from primers FSFTSl to 679) by 
agarose gel electrophoresis. All clones gave FCR DNA products of the 
correct size. 
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All six of the clones were then taken for plasmid DNA 
preparation, and two were sequenced over the region of PCR product In 
a similar manner to Reference Example 7 . The clones were sequenced 
using eight separate oligonucleotide primers known as 1281, 677, 1504, 
679, 1802, 1590, 1280 and 1731 (SEQ ID NOs: 37, 34, 42, 33, 45, 43, 35 
and 44). From the sequencing results a clone containing a plasmid 
with the required D253K-HCPB gene sequence was selected, and is known 
as PICI1713. 

The confirmed sequence of the cloned D253K-HCPB gene in 
PICI1713, showing amino acid translation, from the start of the PelB 
sequence to the EcoRI restriction site is shown as SEQ ID NO: 59 with 
DNA numbering starting from 1 in the first codon of PelB, and peptide 
numbering starting from I in the mature HCPB. 

To obtain controlled expression of the D253K-HCPB, the 
pICI1713 plasmid DNA was transformed into calcium chloride 
transformation competent E.coli expression strains in a similar manner 
to Reference Example 7. All pICI1713 transformed expression strains 
were treated in a similar manner to Reference Example 7 to test for 
expression of the cloned D253K-HCPB gene. In this case the 9E10 
monoclonal antibody specific for the C-myc peptide tag was used in the 
Western analysis, as the D253K-HCPB has the C- terminal (His)g-c-rayc 
tag in a similar manner to Reference Example 7 . 

Expression of the cloned tagged D253K-HCPB in pICI266 
(PICI1713) was demonstrated from E.coli by the Coomassie stained gels 
showing a strong protein band at about 35,000 Daltons when compared to 
vector (pICI266) alone clones, and clones producing the tagged HCPB 
(Reference Example 7). A band of the same size gave a strong signal 
by Western analysis detection of the c-myc tag. 

Example 2 

Cloning and expression of D253R HCPB- (His )g-c-Myc from E. coli 

The method of cloning and expressing the D253R-HCPB in 
E.coli was very similar to the method described in Reference 
Example 8. Again pICI266 was used as the cloning vector, and the 
starting material for PCR of the pro-HCPB gene was plasmid pICI1712 
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(as described in Reference Example 7 > However, in this case site 
directed mutagenesis was used during the PCR amplification of the gene 
to change the codon at amino acid position 253 in the mature gene from 
Aspartate to Arginine (GAC to CGC), the D253R change- Two PCR 
mixtures were prepared, in a manner similar to that described in 
Reference Examples 7 and 8 > In the first reaction primers were 2264 
(SEQ ID NO: 48) and 2058 (SEQ ID NO: 61). In the second reaction 
primers were 6HIS9E10R1BS1 (SEQ ID NO: 41) and 2054 (SEQ ID NO: 62). 
In both reactions the starting DNA was pICI1712. 

Primers 2058 and 2054 (SEQ ID NOs: 61 and 62) are designed 
to anneal around amino acid codon 253, introduce the GAC to CGC change 
in the DNA sequence, and produce complementary sequence at the ends of 
the two PCR products. The other two primers, 2264 and 6HIS9E10R1BS1 
(SEQ ID NOs: 48 and 41) are described in Reference Examples 7 and 8 . 
Aliquots of the two PCR reactions were analysed for DNA of the correct 
size (about 750 and 250 base pairs) and estimation of concentration by 
Agarose gel electrophoresis, and found to contain predominantly bands 
of the correct size. Another PCR was then set up using approximately 
4ng of each of the first two PCR products, in the presence of dNTPs to 
a final concentration of 200pM, Taq polymerase reaction buffer, 2U of 
Taq polymerase in a final volume of 80^1. The mixture was heated at 
94^C for 10 minutes prior to the addition of the Taq enzyme, and PCR 
incubation was carried out using 10 cycles of 94*»C for 1 minute and 
63*'C for 4 minutes. On completion of these cycles the reaction mix 
was made up to 120m1 by the addition of 120pmols of each end primer, 
2264 and 6HIS9E10R1BS1 (SEQ ID NOs: 48 and 41), additional dNTPs 
(approximately an extra lOOyM), Taq polymerase reaction buffer, and 4U 
of Taq polymerase. The mixture was heated at 94*»C for 10 minutes 
prior to addition of Taq enzyme, and the PCR incubation was carried 
out using 30 cycles of 94*>C for 1.5min, 50*»C for 2min, and 72*»C for 
2min, followed by a single incubation of 72*^C for 9.9min at the end of 
the reaction. 

An aliquot of the PCR product was analysed for DNA of the 
correct size (about 1000 base pairs) by agarose gel electrophoresis 
and found to contain predominantly a band of the correct size. The 
remainder of the product from the reaction mix was purified in a 
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similar manner to Reference Example 1 . The isolated DNA was 
restriction digested with enzymes Ncol and EcoRI, and a band of the 
correct size (about 1000 base pairs) purified in a similar manner to 
Reference Example 7 . 

pICI266 double stranded DNA, prepared in a similar manner to 
Reference Example 7 , was restriction digested with Ncol and EcoRI 
enzymes, being very careful to ensure complete digestion. DNA of the 
correct size (about 5600 base pairs) was purified in a similar manner 
to Reference Example 7 , 

Aliquots of both restricted and purified DNA samples were 
checked for purity and concentration estimation using agarose gel 
electrophoresis compared with known standards. From these estimates 
ligation mixes were prepared to clone the HCPB gene into the pICI266 
vector in a similar manner to Reference Example 7 . 

Following the ligation reaction the DNA mixture was used to 
transform E.coli strain DH5a, colonies were picked and tested by 
hybridisation, in a similar manner to Reference Example 7 . 

Three of the clones were then taken for plasmid DNA 
preparation, and were sequenced over the region of PCR product in a 
similar manner to Reference Example 7 . The clones were sequenced- 
using nine separate oligonucleotide primers known as 1281, 677, 1504, 
679, 1802, 1590, 1280, 1731 and 1592 (SEQ ID NOs: 37, 34, 42, 33, 45, 
43, 35, 44 and 54). From the sequencing results a clone containing a 
plasmid with the required D253R-HCPB gene sequence was selected, and 
is known as pICI1746. 

The confirmed sequence of the cloned D253R-HCPB gene cloned 
in pICI1746, showing amino acid translation, from the start of the 
PelB sequence to the EcoRI restriction site is shown as SEQ ID NO: 63 
with DNA numbering starting from 1 in the first codon of PelB, and 
peptide numbering starting from 1 in the mature HCPB. 

To obtain controlled expression of the D253R-HCPB the 
pICI1746 plasmid DNA was transformed into transformation competent 
E.coli expression strains in a similar manner to Reference Example 7 . 
All pICI1746 transformed expression strains were treated in a similar 
manner to Reference Example 7 to test for expression of the cloned 
D253R-HCPB gene. In this case the 9E10 monoclonal antibody specific 
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for the C-myc peptide tag was used in the Western analysis, as the 
D253R-HCPB has the C- terminal (His)g-c-myc tag in a similar manner to 
Reference Example 7 . 

Expression of the cloned tagged D253R-HCPB in pICI266 
(pICI1746) was demonstrated from E.coli by the Coomassle stained gels 
showing a strong protein band at about 35,000 Daltons when compared to 
vector (pICI266) alone clones, and clones producing the tagged HCPB 
(Reference Example 7). A band of the same size gave a strong signal 
by Western analysis detection of the c-myc tag. 

Purification is achieved using methodology analogous to that 
set out below in Example 3. 



Example 3 

Purification of mutant D253K HCPB- (His )g-c-Myc proteins from E. coli 

First a 20 litre fermentation process for carboxypeptidase B 
analogue D253K in a cell paste is described. E. coll K12 strain MSD 
1924 was transformed with plasraid pZen 1713 (pICI 1713; see Example 1 
above) and the resultant strain MSD 2230 (MSD 1924 pZen 1713) was 
stored in glycerol freezing mix at -80»C. 

MSD 2230 was streaked onto agar plates containing 
L-tetracycllne (lOpgml'^) medium to separate single colonies after 
overnight growth at 37»C. Six single colonies of MSD 2230 were 
removed from the surface of the L-tetracycllne (lOugml"^) agar, re 
suspended in a 10ml L-tetracycllne (lOpgral"^) broth and lOOul of this 
culture was Immediately inoculated into each of six 250ml Erlenmeyer 
flasks containing 75ml of L-tetracydine (lOygml"^) broth. After 
growth for 15-16 hours at 37''C on a reciprocating shaker (300rpm) the 
contents of the flasks were pooled and used to inoculate a single 
fermenter (U30D vessel, B. Braun, Melsungen, Germany) containing 15 
litres of the growth medium described in Figure 6. 

The fermentation was performed at a temperature of 37''C and 
pH of 6.7 and pH of 6.7 which was automatically controlled to the set 
point by the addition of 6M sodium hydroxide or 2M sulphuric acid. 
The dissolved oxygen tension (dOT) set point was SOX air saturation 
and it was maintained by the automatic adjustment of the fermenter 
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Stirrer speed between 200 and 1000 rpm. The air flow to the fermenter 
was maintained at 20 standard litres per minute which corresponds to 
1.3 vessel volumes per minute (wm) by a Tyian mass flow controller. 

4.5 hours following inoculation, a solution of yeast extract 
(225gl ) was fed into the fermenter at a rate of 190-210mlh"^ for 
28.5 hours. 1.5 hours after the yeast extract feed was started, the 
fermentation temperature set point was reduced to 25'C. When this 
temperature was attained, approximately 1 hour later, expression of 
the carboxypeptidase analogue D253K was induced with a single shot 
addition of 50% arabinose to give a final concentration in the 
fermenter vessel of 0.5Z. 1-2 hours following induction, a mixture of 
glycerol (714gl" ) and ammonium sulphate (143gl"^) was fed into the 
fermenter at 45-55mlh"^ until harvest. The fermentation was continued 
under these conditions until ca. 75 hours post fermenter inoculation 
when the culture was harvested by transferring aliquots of the 
fermenter contents into 1 litre centrifuge bottles. The spent medium 
was separated from the bacterial cells by centrifugation in a Sorvall 
RC-3B centrifuge (7,000x g, 4»C, 30min.). This process typically 
yields a final dry weight of ca.20gl"^. 

The cell paste was purified as follows. Recombinant E.coli 
cell paste containing the recombinant enzyme, D253K HCPB, was taken 
from storage at -70»C and allowed to thaw. The weight of cell paste 
was measured and found to be 309 grams. The paste was resuspended with 
the addition of buffer A (200mH Tris (hydroxymethyl)aminomethane 
hydrochloride (TRIS-HCl), 20% sucrose, pH 8.0] to give a resuspended 
volume of 320 ml. The cell suspension was incubated at room 
temperature for 20 minutes with occasional gentle mixing before an 
equal volume of distilled water, at room temperature, was added and 
thoroughly mixed in. The cell suspension was again incubated at room 
temperature for 20 minutes with occasional gentle mixing. 

The resulting crude osmotic shockate was clarified by 
centrifugation at 98000 x g for 90 minutes at 4»C after which the 
supernatant was decanted off from the pelleted insoluble fraction, 
giving a clarified volume of 240 ml. Deoxyribonuclease 1 (24mg) was 
dissolved in distilled water (5ml) and added to the supernatant. The 
mixture was incubated at room temperature, with continuous shaking for 
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30 minutes to reduce the vicosity of the supernatant enough for it to 
be loaded on to a Carbox3T3eptidase Inhibitor CNBr activated Sepharose 
affinity column, prepared according to instructions with the CNBr 
activated Sepharose 4B from Pharmacia and carboxypeptidase inhibitor 
from potato tuber (c-0279, Sigma) • The supernatant was diluted 1:1 
with lOmM TRIS-HCl, 500mM sodium chloride, pH 8.0 (Buffer B), adjusted 
to pH8.0 and loaded, over night, on to the Carboxypeptidase inhibitor 
affinity column at 0.5 ml/min. The column was pre-equilibrated with 
buffer B at 4*»C. After loading the supernatant, the column was washed 
until the absorbance of the flow through was back to baseline before 
the bound material was eluted from the column by elutlon buffer (lOOmM 
sodium carbonate, 500mM sodium chloride, pH 11.4) at 4*»C,wlth 1ml 
fractions being collected. The eluted fractions were frozen at -20'»C 
after samples were taken to determine those containing the recombinant 
carboxypeptidase. This was accomplished by Western blot analysis 
using an antl- c-myc tag antibody (9E10), followed by an anti-mouse 
-horseradish peroxidase conjugate (a-9044, sigma) that gave a colour 
reaction with exposure to 4-chloro-naphthol and hydrogen peroxide. 
Fractions 11 to 44 were determined to contain the recombinant 
carboxypeptidase B. These were pooled, the pH adjusted to pH7.5 and 
concentrated using a Millipore Centlfugal Ultrafree -20 (10,000 
molecular weight cut off) before being snap-frozen and stored at 
-20«C. The purification detailed here provided 4.7mg of D253K mutant 
carboxypeptidase at a purity of 807., in a volume of 0.95 ml. 

Example 4 

Synthesis of an aspartic acid phenol mustard prodrug (compound 5a» 
Figure 7) 

(2S),2-(3-{4-[bis-(2-chloroethyl)-amino)-phenoxycarbonyl}-propionyl- 
amino) -succinic acid 

Analogous methodology to that set out in Reference Example 4 

was used. 

(2S),2-(3-{4-[bis-(2-chloroethyl)-amlno)-phenoxycarbonyl)- 
propionylamlno) -succinic acid dibenzyl ester (4a) was hydrogenated for 
2h at 80 psi to give the desired end product 5a (yield: 86%). 
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5a: IflNMR (CD30D): 2.65-2.75 (t. 2H); 2.8-2.9 (m, 4H); 3.7-3.75 (m 
4H), 3.8-3.85 (m, AH)-, 4.75 (t, IH); 6.7-6.8 (m, 2H)5 7.0-7.1 (m. 2H). 
MS (ESI): 471-473 (MNa)* 

Anal. {C^QJl22^2°yCh ^'^ ^2°^ 

Calc. XC: 45.56 H: 5.27 N: 5.90 

Found %C: 45.79 H: 5.60 N: 5.91 

Starting material compound 4a was prepared as follows. 

(2S),2-amino-succinic acid dibenzyl ester (Compound 2a) was 
reacted to give (2S), 2- (3-carboxypropionylamino) -succinic acid 
dibenzyl ester (compound 3a) after recrystallisation with diethyl 
ether/hexane: (Yield: SOX). 

3a: IHNMR (00013): 2.42-2.6 (m, 2H); 2.6-2.75 (m, 2H), 2.85 (dd, 2H); 

3.1 (dd, 1H)5 4.9 (dd, IH); 5.05 (dd, 2H); 5.15 (s. 2H) ; 6.7 (d, IH) • 

7.25-7.5 (m, 10 H). 

MS (ESI): 436 [MNa]* 

Anal. (C22H23NO7 O.4H2O): 

Calculated %C: 62.82 H: 5.70 N: 3.33 

Found %C: 63.2 H: 5.75 N: 2.9 



3a was reacted to give the desired starting material 4a 
(yield: 78 X) (stirring was maintained for 3h at room temperature and 
purification was achieved by flash chromatography using diethyl 
ether/hexane (70/30 V/V as eluent). 

4a: IHNMR (CDCI3): 2.55-2.65 (m, 2H); 2.8-2.9 (m, 2H); 2.9 (dd, IH); 
3.1 (dd, IH); 3.6 (dd, 4H); 3.7 (dd, 4H); 4.9 (dd, IH); 5.05 (dd, 2H); 
5.15 (s, 2H); 6.58 (d, IH); 6.65 (d, 2H); 6.95 (d, 2H); 7.25-7.4 (m 
10 H). 

MS (ESI): 651-653 (MNa)* 
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Example 5 

Synthesis of a glutamic acid phenol mustard prodrug (5b; Figure 7) 
(2S) ,2- (3- {4- (bis- (2-chloroethyl) -amino )-phenoxycarbonyl} -propionyl- 
-amino}-pentanedioic acid 

Analogous methodology to that set out in Reference Example 4 

vas used. 

(2S) , 2- (3- {4- (bis- (2-chloroethyl ) -amino) -phenoxycarbonyl} - 
propionylamino)-pentanedioic acid dibenzyl ester (4b) was hydrogenated 
for 3 h at 60 psi to give the desired end product 5b (yield: 931). 
5b! IHNMR (CD3OD): 1.9-2.0 (ra, IH); 2.1-2.2 (m, IH); 2.35-2.45 (m, 
2H); 2.55-2.7 (m, 2H); 2.8-2.9 (m, 2H); 3.65-3.7 (m, 4H); 3.72-3.8 (m, 
4H); 4.4-4.5 (m, IH); 6.75 (d, 2H) ; 6.95 (d, 2H). 
MS (ESI): 485-487 (HNa)* 

Starting material compound 4b vas prepared as follows. 

« (2S),2-amino-pentanedioic acid dibenzyl ester (2b) vas 
reacted to give (2S),2-(3-carboxypropionylamino)-pentanedioic acid 
dibenzyl ester (3b) (Yields quantitative) 

3b: IHNMR (CDCI3)! 2.0-2.1 (m, IH); 2.2-2.3 (m, IH); 2.3-2.5 (m, 4H); 
2.6-2.7 (m, 2H); 4.65 (dd. IH); 5.05 (s, 2H); 5.15 (s, 2H); 6.5 (d, 
IH); 7.3-7.4 (m, 10 H). 
MS (ESI): 450 (MNa]+ 

3b was reacted to give the desired starting material 4b (yield: 82X). 
4b: IHNMR (CDCI3); 1.95-2.05 (m, IH); 2,2-2.3 (m, IH); 2.3-2.5 (m, 
2H); 2.6 (dt, 2H); 2.8-3.0 (m, 2H); 3.6 (dd, 4H); 3.7 (dd, 4H) ; 4.7 
(dd, IH); 5.1 (s, 2H); 5.2 (s, 2H) ; 6.3 (d, IH) ; 6.6 (d, 2H) ; 6.95 (d, 
2H); 7.3-7.4 (m, 10 H). 
MS (ESI) : 665-667 (MNa)* 
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Example 6 

Assay of activity of mutant human CPB and native human CPB against 
Hipp-Asp and Ripp-Glu prodrug analogues. 

Purified mutants of human CPB (D253K and D253R; Examples 
1-3) and native human CPB, produced as described in Reference Example 
12, were assayed for their ability to convert either 
hippuryl-L-aspartic acid (Hipp-Asp - Reference Example 2), 
hippuryl-L-glutamic acid (Hipp-Glu - Reference Example 1) or 
hippuryl-L-arginine (Sigma Chemical Company - cat no. H6625) to 
hippuric acid using a HPLC based assay. 

The reaction mixture (250 yl) contained 4 yg human CPB 
(native or mutant) and 0.5 mM Hipp-Asp or Hipp-Glu in 0.025 M 
Tris-HCL, pH 7.5. Samples vere incubated for 5 hr at 37«»C. The 
reactions were terminated by the addition of 250 ul of BOX methanol, 
20% distilled water, 0.2% trifluoro acetic acid and the amount of 
hippuric acid generated was quantified by HPLC. 

HPLC analysis was carried out using a Hewlett Packard 1090 
Series 11 (with diode array) HPLC system. Samples (50 pi) were 
injected onto a Hichrom Hi-RPB column (25 cm) and separated using a 
mobile phase of 40% methanol, 60% distilled water, 0.1% trifluoro 
acetic acid at a flow rate of Iml/min. The amount of product (hippuric 
acid) produced was determined from calibration curves generated with 
known amounts of hippuric acid (Sigma-H6375) . The results are shown in 
Table 1 and are expressed as the percentage conversion of substrate 
into product in 5 hr at 37"'C with 4 pg enzyme. 
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Table 1. Conversion of Hipp-Asp and Hipp-Glu by mutant and native 
human CPB 

Hipp-Asp Hipp-Glu Hipp-Arg 
(% conversion to Hippuric acid) 

Native CPB 0 0 100 

D253K mutant CPB 78 91 <2 

D253R mutant CPB 72 52 3 

The data show that introduction of either a lysine or 
arginine residue at position 253 in human CPB instead of the aspartate 
residue present in the native enzyme changes the substrate specificity 
of the enzjnne so that it is capable of conversion of either .Hipp-Asp 
or Hipp-Glu, In contrast, the native enzyme is unable to convert 
either of these compounds into Hippuric acid but does convert Hipp-Arg 
to hippuric acid. The best activity was seen with the D253K mutant and 
the Hipp-Glu substrate. 

Example 7 

Determination of Km and kcat of HCPB mutants with Hipp-Asp and 
Hipp-Glu. 

Purified D253K HCPB, IQ54R,D145A,D253KJHCPB, [G251N,D253K]HCPB and 
(G251T,D253KJHCPB were produced as described in Example 3, Reference 
Example 12, Example 27 and Example 28 respectively were assayed 
against Hipp-Asp (Reference Example 2) and/or Hipp-Glu (Reference 
Example 1) to determine Km and kcat for these substrates. Hipp-Glu and 
Hipp-Asp were diluted in range 0.25-8.0 mM and 0.25-5.0 mH 
respectively in 0.025 M Tris-HCL buffer, pH 7.5. Where necessary 
substrate samples were adjusted to pH 7.5 with IM NaOH. 

D253K HCPB (4yg/ral for Hipp-Asp and 0.5Mg/ml for Hipp-Glu), 
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IQ54R,DU5A,D253K]HCPB (1.5pg/inl f or Hipp-Glu) , [G251N,D253K]HCPB 
(0.14yg/ml for Hipp-Glu) and IG251T,D253K]HCPB (0.02Mg/ml for 
Hipp-Glu) were added to these substrates (500m1 reaction volume) to 
start the reaction. Samples were incubated for 5h at 37 'C. Reactions 
were terminated by the addition of 500pl methanol/distllled water 
(80/20) containing 0.2% TFA. The amount of hippuric acid produced was 
quantified by HPLC as described in Example 6. 

Km and Vmax values were calculated using the ENZFITTER software 
programme (Biosoft, Perkin Elmer). Kcat was calculated from Vmax by 
dividing by the enzyme concentration in the reaction mixture (using a 
molecular weight for HCPB of 34 ICDa). The results are shown in Tables 
2a and 2b. 



and kcat data for Hipp-Asp and Hipp-Glu with D253K mutant HCPB 



Table 2a 



Km (mM) 



kcat (s" ) 



kcat/Km (mM"S"^) 



Hipp-Asp 



2.7 



0.26 



0.1 



Hipp-Glu 



5.3 



3.8 



0.7 
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Table 2b 

Kb and kcat data for Hipp>Glu vith HCPB mutants 
Mutant 

IQ54R,D145A,D253K] 
(G251N,D253K] 
IG251T,D253KJ 

The data confirm that replacing aspartate with a lysine residue at 
position 253 in human CPB results in an enzyme which can convert both 
Hipp-Asp and Hipp-Glu into hippuric acid with reasonable enzyme 
kinetics. The kcat/Kra is approximately 7 fold greater with the 
Hipp-Glu compared to the Hipp-Asp substrate with D253K HCPB. 
Introduction of additional mutations increases the activity of D253K 
HCPB versus Hipp-Glu by up to 100 fold. 

Example 8 

Assay of activity of mutant HCPB and native HCPB against glutamic acid 
prodrug 

Purified D253K HCPB and native human CPB, produced as 
described in Example 3 and Reference Example 12 respectively, were 
assayed for their ability to enzymatically cleave glutamic acid from a 
glutamic acid prodrug (Example 5). Cleavage liberates an intermediate 
(Reference Example 5) which self collapses non-enzymatically to 
release the active phenol mustard drug. Conversion of the glutamic 
acid prodrug to intermediate was measured using a HPLC based assay. 

Prodrug was diluted in the range 0.25-5.0 mM in 0.025 H 
Trls-HCL buffer, pH 7.5. Where necessary prodrug samples were adjusted 
to pH 7.5 with IM NaOH. D253K mutant HCPB or native HCPB, both at a 



Km (mM) kcat (s'^) kcat/Km (mM"^s'^) 

10.6 15 1.4 

2.3 24 10 

1.1 75 68 
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final concentration of 0.25 mg/ml, were added to the these substrates 
(250m1 reaction volume prewarmed to 37*»C for 2 min) to start the 
reaction. Samples were incubated for 4 minutes at 37**C- The reaction 
was terminated by the addition of 250|il 98. 8X MeCN, 0-2% TFA and the 
samples placed on ice. The amount of intermediate produced was then 
quantified by HPLC. 

HPLC separation was carried out as described in Example 6 
except that a mobile phase of MeCN/distilled water (55/45 V/V) 
containing 0.1% TFA was used to achieve separation of the prodrug 
(retention time 4.9 minutes) and intermediate (retention time 8.4 
minutes). The amount of intermediate produced was quantified from 
calibration curves generated with known amounts of the intermediate. 

The amount of intermediate formed at 5.0 mM and 0.25 mM 
prodrug with native and mutant (D253K) HCPB in replicate samples is 
shown in Table 3. 

Table 3 



Conversion of prodrug to intermediate by native and mutant (D253K) 
HCPB. 



Prodrug concentration Intermediate concentration (mM) 

(fnM) Native HCPB Mutant HCPB 

5-0 0, 0 0.023, 0.022 

0-25 0, 0 0.005, 0.005 

Km, Vmax and kcat values for the mutant human enzyme (D253K) 
and the prodrug were calculated from the amount of intermediate 
produced over a range of substrate concentrations (0.25-5.0 mM) using 
the ENZFITTER software described in Example 7. The results for the 
D253K mutant HCPB were: 



Km = 1.25 mM 
Vmax « 1.17 X 10"*mMsec^^ 
kcat = 0.016 sec"^ 
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The data show that introduction of a lysine residue at 
position 253 in human CPB instead of the aspartate residue present in 
the native enzyme changes the substrate specificity of the enzyme so 
that it is capable of conversion of the glutamic acid prodrug into its 
self -collapsing intermediate. In contrast, the native enzyme is 
unable to convert the prodrug to its intermediate* Since the prodrug 
is relatively non-cjrtotoxic (Example 9) and the intermediate is 
non-enzymatically broken down to release free phenol mustard drug 
which kills tumour cells (Example 9) these results demonstrate that 
mutation of active site residues of CPB can yield a mutant human 
enzyme capable of converting a relatively non-cytotoxic prodrug into a 
potent cytotoxic drug capable of killing tumour cells. 

Example 9 

Cytotoxicity of glutamic acid prodrug and phenol mustard drug in LoVo 
human colorectal tumour cells. 

The differential cytotoxicity to tumour cells of the 
glutamic acid prodrug (Example 5) and corresponding phenol mustard 
drug (Figure 7, Compound 6) has been demonstrated by the following 
means • 

LoVo colorectal tumour cells were incubated with prodrug or 
drug over a final concentration range of 5 X 10"^ to 5 X 10"^M in 
96-well (2,500 cells/well) microtitre plates for Ih at 37**C. The cells 
were then washed and incubated for a further three days at 37*C. After 
washing to remove dead cells, TCA was then added and the amount of 
cellular protein adhering to the plates was assessed by addition of 
SRB dye as described by ?• Skehan et al, J, Natl, Cancer Inst. 82, 
1107 (1990). Potency of the compounds was assessed by the 
concentration required to inhibit cell growth by 505: (IC^q). 

Upon treatment of LoVo cells with the phenol mustard drug an 
ICjQ of approximately lyH was seen. In contrast the glutamic acid 
prodrug was much less cytotoxic with an IC^q of approximately 50pM 
(Figure 5). Thus the mutant CPB glutamic acid prodrug is approximately 
50 fold less cytotoxic to tumour cells than the phenol mustard drug. 
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If lOOjjg of mutant HCPB (D253K) produced as described in 
Example 3 is added to the assay wells containing the glutamic acid 
prodrug cytotoxicity can be seen which is comparable to that of the 
active drug thus demonstrating conversion of the prodrug by the mutant 
enzyme to release the more potent drug. Addition of lOOng of native 
human CPB to each well does not significantly enhance the cytotoxicity 
of the glutamic acid prodrug. These studies demonstrate the potential 
of the mutant human CPB enzyme (D253K) to selectively convert a 
relatively inactive prodrug into a potent cytotoxic drug capable of 
killing tumour cells. 

Example 10 

Preparation of humanised A5B7 F(ab')2-D253K HCPB fusion protein 

The procedure described in Reference Example 13 is repeated 
but with murine A5B7 light chain and Fd sequences replace by sequences 
for humanised A5B7, and with the HCPB sequence replaced by D253K 
sequence. The 8 amino acid linker sequence described in Reference 
Example 13 f) is replaced by the equivalent human sequence, APPVAGPS 
(SEQ ID NO: 66). The fusion protein is expressed from COS cells by 
co-transfection with the HCPB prepro sequence as described in 
Reference Example 13. Large-scale expression of the fusion protein is 
performed by transiently introducing the plasmid vectors (750ng of 
each) into COS-7 cells (11) essentially as described in Reference 
Example 13. The product is purified either by passing the supernatant 
containing the fusion protein over immobilised protein A and elution 
of the bound fusion protein with high pH buffer or by passing the 
supernatant containing the fusion protein over immobilised 
carboxypeptidase inhibitor, following the route used for the 
purification of the recombinant carboxypeptidase enzyme, and elution 
with the same high pH as used with the enzyme in Example 3. Both 
these routes may involve further purification of the fusion protein by 
either gel permeation chromatography, ion exchange chromatography, 
hydrophobic interaction chromatography singly, or a combination of 
them. 
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The procedure described in Reference Example 13 Is repeated 
but the murine sequences for Fd and light chain, as shown in SEQ ID 
NOS 20 and 22 respectively, are replaced by the humanised sequences 
shown in SEQ ID NOs 24 and 26 respectively. The HCPB sequence in 
Reference Example 13 is replaced by the D253K sequence [described in 
Example 1, but without the (His)g-c-Myc tags]. The template for PCR 
in Reference Example 13 (pICI1698) is replaced by 
pICI1713 (described in Example 1). 

The humanised sequences shown in SEQ ID NOs 24 and 26 are 
prepared by a variety of methods including those described by Edwards 
(1987) Am. Biotech. Lab. 5, 38-44, Jayaraman et al. (1991) Proc. Natl. 
Acad. Sci. USA 88, 4084-4088, Foguet and Lubbert (1992) Biotechniques 
13, 674-675 and Pierce (1994) Biotechniques 16, 708. 

Example 11 

Shake flask fermentation for preparation of D253K HCPB 

E.coli strain MSD 213 was transformed with plasmid pICI 1713 
(see Example 1) and the resultant strain MSD 213 pZen 1713 stored as a 
glycerol stock at -eO-C. An aliquot of MSD 213 pZen 1713 was streaked 
onto agar plates of L-tetracycllne to separate single colonies after 
overnight growth at 37''C. A single colony of MSD 213 pZen 1713 was 
removed and inoculated into a 250ml Erlenmeyer flask containing 75ml 
of L-tetracycline broth. After growth for 16h at 37''C on a 
reciprocating shaker the contents of the flask were used to inoculate 
to OD550 = 0.1 each of nine 2L Erlenmeyer flasks containing 600ml of 
L-tetracycline broth. The flasks were then incubated at 20»C on a 
reciprocal shaker until growth, estimated by measuring the optical 
density of the culture, reached OD550 = 0.5. At this point 
heterologous protein production was induced by adding L-arabinose to 
the cultures to a final concentration of 0.01%w/v and the incubation 
continued at 20''C as described above for a further 42h. The spent 
medium was separated from the bacterial cells by centrifugation in a 
Sorvall RC-3B centrifuge (7000x g, 40c, 30min) and the cell paste 
stored at -70"*C. 
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Example 12 

Cloning and expression of [G251N,D253RJHCPB-(His)g-c-l!yc from E. coli 

The method of cloning and expressing the IG251N,D253R]HCPB 
in E.coli was very similar to the method described in Reference 
|xan!Ele_8. The gene for IG251N,D253R]HCPB was prepared as described 
in Example 2 but the starting material for PCR site directed 
mutagenesis was the D253R HCPB gene in plasmid pICI1764 (described in 
ISSJSEle^) in place of pICI1712. However, in this case site directed 
mutagenesis was used during the PCR amplification of the gene to 
change the codon at amino acid position 251 in the mature gene from 
Glycine to Asparagine (GGC to AAC), the G251N change. 

Two PCR mixtures were prepared, in a manner similar to that 
described in Reference Examples 7 and 8. Jn the first reaction 
primers were 2264 (SEQ ID NO: 48) and 1038 (SEQ ID NO: 68, replacing 
SEQ ID NO: 61). In the second reaction primers were 6HIS9E10R1BS1 
(SEQ ID NO: 41) and 1043 (SEQ ID NO: 69, replacing SEQ ID NO: 62). In 
both reactions the starting DNA was pICI1764. 

Primers 1038 and 1043 (SEQ ID NOs: 68 and 69) are designed 
to anneal around amino acid codon 251, introduce the GGC to AAC change 
in the DNA sequence, and produce complementary sequence at the ends of 
the two PCR products. 

The confirmed sequence of the cloned (G251N,D253R]HCPB gene 
cloned in pMC12.5.4, showing amino acid translation, from the start of 
the PelB sequence to the EcoRI restriction site is similar to that 
shown as SEQ ID NO: 63 with DNA numbering starting from 1 in the first 
codon of PelB, and peptide numbering starting from 1 in the mature 
HCPB with the exception that amino acid number 251 is changed to an 
Asparagine (Asn) and the associated codon changed to AAC. 

To obtain controlled expression of the IG251N,D253RIHCPB the 
PMC12.5.4 plasmid DNA was transformed into transformation competent 
E.coli expression strain MSD213 in a similar manner to Reference 
Example 7. The pMC12.5.4 transformed expression strain has been 
treated in a similar manner to Reference Example 7 to test for 
expression of the cloned [G251N,D253R|HCPB gene. In this case the 
9E10 monoclonal antibody specific for the C-myc peptide tag was used 
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The procedure described in Reference Example 13 is repeated 
but the murine sequences for Fd and light chain, as shown in SEQ ID 
NOS 20 and 22 respectively, are replaced by the humanised sequences 
shown in SEQ ID NOs 24 and 26 respectively. The HCPB sequence in 
Reference Example 13 is replaced by the D253K sequence [described in 
Example 1, but without the (His)g-c-Myc tagsj. The template for PCR 
in Reference Example 13 (pICI1698) is replaced by 
pICI1713 (described in Example 1). 

The humanised sequences shown in SEQ ID NOs 24 and 26 are 
prepared by a variety of methods including those described by Edwards 
(1987) Am. Biotech. Lab. 5, 38-44, Jayaraman et al. (1991) Proc. Natl. 
Acad. Sci. USA 88, 4084-4088, Foguet and Lubbert (1992) Biotechniques 
13, 674-675 and Pierce (1994) Biotechniques 16, 708. 

Example 11 

Shake flask fermentation for preparation of D253i: HCPB 

E.coli strain MSD 213 was transformed with plasmid pICI 1713 
(see Example 1) and the resultant strain MSD 213 pZen 1713 stored as a 
glycerol stock at .80*»C. An aliquot of MSD 213 pZen 1713 was streaked 
onto agar plates of L-tetracycline to separate single colonies after 
overnight growth at 37^C. A single colony of MSD 213 pZen 1713 was 
removed and inoculated into a 250ml Erlenmeyer flask containing 75ml 
of L-tetracycline broth. After growth for 16h at 37*C on a 
reciprocating shaker the contents of the flask were used to inoculate 
to OD550 =0.1 each of nine 2L Erlenmeyer flasks containing 600ml of 
L-tetracycline broth. The flasks were then incubated at 20^C on a 
reciprocal shaker until growth, estimated by measuring the optical 
density of the culture, reached OD550 « 0.5. At this point 
heterologous protein production was induced by adding L-arabinose to 
the cultures to a final concentration of 0.01%w/v and the incubation 
continued at 20**C as described above for a further 42h. The spent 
medium was separated from the bacterial cells by centrifugation in a 
Sorvall RC-3B centrifuge (7000x g, 4«C, 30min) and the cell paste 
stored at -70^*0. 
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Example 12 

Cloning and expression of [G251N,D253RJHCPB-(His)g-c-«yc from E. coll 

The method of cloning and expressing the IG251M,D253RJHCPB 
in E.coli was very similar to the method described in Reference 
Ex§!!!Ele_8- The gene for [G251N,D253R]HCPB was prepared as described 
in Example 2 but the starting material for PCR site directed 
mutagenesis was the D253R HCPB gene in plasmid pICI1764 (described in 
Example 2 ) in place of picn712. However, in this case site directed 
mutagenesis was used during the PCR amplification of the gene to 
change the codon at amino acid position 251 in the mature gene from 
Glycine to Asparagine (GGC to AAC), the G251N change. 

Two PCR mixtures were prepared, in a manner similar to that 
described in Reference Ex amples 7 and 8 . In the first reaction 
primers were 2264 (SEQ ID NOs 48) and 1038 (SEQ ID NO: 68, replacing 
SEQ ID NO: 61). In the second reaction primers were 6HIS9E10R1BS1 
(SEQ ID NO: 41) and 1043 (SEQ ID NO: 69, replacing SEQ ID NO: 62). In 
both reactions the starting DNA was pICI1764. 

Primers 1038 and 1043 (SEQ ID NOs: 68 and 69) are designed 
to anneal around amino acid codon 251, introduce the GGC to AAC change 
in the DNA sequence, and produce complementary sequence at the ends of 
the two PCR products. 

The confirmed sequence of the cloned [G251N,D253R]HCPB gene 
cloned in pMC12.5.4, showing amino acid translation, from the start of 
the PelB sequence to the EcoRI restriction site is similar to that 
shown as SEQ ID NO: 63 with DNA numbering starting from 1 in the first 
codon of PelB, and peptide numbering starting from 1 in the mature 
HCPB with the exception that amino acid number 251 is changed to an 
Asparagine (Asn) and the associated codon changed to AAC. 

To obtain controlled expression of the [G251N,D253RIHCPB the 
PMC12.5.4 plasmid DNA was transformed into transformation competent 
E.coli expression strain HSD213 in a similar manner to Reference 
Example 7. The pMC12.5.4 transformed expression strain has been 
treated in a similar manner to Reference Example 7 to test for 
expression of the cloned [G251N,D253R1HCPB gene. In this case the 
9E10 monoclonal antibody specific for the C-myc peptide tag was used 
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in the Western analysis, as the [G251N,D253R1HCPB has the C-terminal 
(His)g-c-myc tag in a similar manner to Reference Example 7 « 

Expression of the cloned tagged (G251N,D253R]HCPB in pICI266 
(pMC12.5.4) was demonstrated from E.coli by the Coomassie stained gels 
showing a strong protein band at about 35,000 Daltons when compared to 
vector (pICI266) alone clones, and clones producing the tagged HCPB 
( Reference Example 7 ). A band of the same size has given a strong 
signal by Western analysis detection of the c-myc tag* 

Example 13 

Cloning and expression of IG251M,D253KJHCPB-(His)g-c-Myc from E. coli 

The method of cloning and expressing the [G251N,D253K1HCPB 
in E.coli was very similar to the method described in Reference 
Example 8. The gene for [G251N,D253K]HCPB was prepared as described 
in Example 2 but the starting material for PGR site directed 
mutagenesis was the D253K HCPB gene in plasmid pICI1713 (described in 
Example 1) in place of pICI1712- However, in this case site directed 
mutagenesis was used during the PCR amplification of the gene to 
change the codon at amino acid position 251 in the mature gene from 
Glycine to Asparagine (GGC to AAC), the G251N change- Two PCR 
mixtures were prepared, in a manner similar to that described in 
Reference Examples 7 and 8 , In the first reaction primers were 2264 
(SEQ ID NO: 48) and 2261 (SEQ ID NO: 70, replacing SEQ ID NO: 61). In 
the second reaction primers were 6HIS9E10R1BS1 (SEQ ID NO: 41) and 
2260 (SEQ ID NO: 71, replacing SEQ ID NO: 62). In both reactions the 
starting DNA was pICI1713. Primers 2261 and 2260 (SEQ ID NOs: 70 and 
71) are designed to anneal around amino acid codon 251, introduce the 
(SGC to AAC change in the DNA sequence, and produce complementary 
sequence at the ends of the two PCR products. 

The confirmed sequence of the cloned IG251N,D253K]HCPB gene 
cloned in pMC43.1, showing amino acid translation, from the start of 
the PelB sequence to the EcoRI restriction site is similar to that 
shown as SEQ ID NO: 59 with DNA numbering starting from 1 in the first 
codon of PelB, and peptide numbering starting from 1 in the mature 
HCPB with the exception that amino acid number 251 is changed to an 
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Asparagine (Asn) and the associated codon changed to AAC. 

To obtain controlled expression of the [G251N,D253KJHCPB the 
pHC43.1 plasmld DNA was transformed into transformation competent 
E.coli expression strain MSD213 in a similar manner to Reference 
Example 7 > The pMC43,l transformed expression strain has been treated 
in a similar manner to Reference Example 7 to test for expression of 
the cloned IG251N,D253KJHCPB gene. In this case the 9E10 monoclonal 
antibody specific for the C-myc peptide tag was used in the Western 
analysis, as the IG251N,D253KJHCPB has the C-terminal (His)g-c-myc tag 
in a similar manner to Reference Example 7 , 

Expression of the cloned tagged [G251N,D253KJHCPB in pICI266 
(pMC43.1) Was demonstrated from E.coli by the Coomassie stained gels 
showing a strong protein band at about 35,000 Dal tons when compared to 
vector (pICI266) alone clones, and clones producing the tagged HCPB 
( Reference Example 7 )> A band of the same size has given a strong 
signal by Western analysis detection of the c-myc tag. 

Example 14 

Cloning and expression of [G251T,D253KJHCPB-(His)g-c-Myc from £• coli 

The method of cloning and expressing the IG251T,D253K]HCPB 
in E.coli was very similar to the method described in Reference 
Example 8 . The gene for [G251T,D253KJHCPB was prepared as described 
in Example 2 but the starting material for PGR site directed 
mutagenesis was the D253K HCPB gene in plasmid pICI1713 (described in 
Example 1 ) in place of pICI1712. However, in this case site directed 
mutagenesis was used during the PGR amplification of the gene to 
change the codon at amino acid position 251 in the mature gene from 
Glycine to Threonine (GGC to ACT), the G251T change. Two PGR mixtures 
were prepared, in a manner similar to that described in Reference 
Examples 7 and 8 . In the first reaction primers were 2264 (SEQ ID NO: 
48) and 1038 (SEQ ID NO: 68, replacing SEQ ID NO: 61). In the second 
reaction primers were 6HIS9E10R1BS1 (SEQ ID NO: 41) and 2659 (SEQ ID 
NO: 72, replacing SEQ ID NO: 62). In both reactions the starting DNA 
was pICI1713. Primers 1038 and 2659 (SEQ ID NOs: 68 and 72) are 
designed to anneal around amino acid codon 251, introduce the GGC to 
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ACT change in the DNA sequence, and produce complementary sequence at 
the ends of the two FCR products. 

The confirmed sequence of the cloned [G251T,D253K]HCPB gene 
cloned in pMC46.4.1, shoving amino acid translation, from the start of 
the PelB sequence to the EcoRI restriction site is similar to that 
shown as SEQ ID NO: 59 with DNA numbering starting from 1 in the first 
codon of FelB, and peptide numbering starting from 1 in the mature 
HCPB vith the exception that amino acid number 251 is changed to an 
Threonine (Thr) and the associated codon changed to ACT. 

To obtain controlled expression of the [G251T,D253KJHCPB the 
pMC46.4.1 plasmid DNA was transformed into transformation competent 
E.coli expression strain MSD213 in a similar manner to Reference 
Example 7 . The pHC46.4.1 transformed expression strain has been 
treated in a similar manner to Reference Example 7 to test for 
expression of the cloned [G251T,D253K]HCPB gene. In this case the 
9E10 monoclonal antibody specific for the C-myc peptide tag was used 
in the Western analysis, as the [G251T,D253K)HCPB has the C-terminal 
(His)g-c-myc tag in a similar manner to Reference Example 7 , 

Expression of the cloned tagged [G251T,D253K]HCPB in pICI266 
(pMC46.4.1) was demonstrated from E.coli by the Coomassie stained gels 
showing a strong protein band at about 35,000 Daltons when compared to 
vector (pICI266) alone clones, and clones producing the tagged HCPB 
( Reference Example 7 ). A band of the same size has given a strong 
signal by Western analysis detection of the c-myc tag. 

Example 15 

Cloning and expression of [G251N,D253K,T266G]HCPB-(His)g-c-Myc from E. 
coll 

The method of cloning and expressing the 
[G251N,D253K,T266GIHCPB in E.coli was very similar to the method 
described in Reference Example 8 . The gene for 

(G251N,D253K,T266GJHCPB was prepared as described in Example 2 but the 
starting material for PCR site directed mutagenesis was the 
[G251N,D253K1HCPB gene in plasmid pMC43.1 (described in Example 131 in 
place of pICI1712. However, in this case site directed mutagenesis 
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vas used during the FCR amplification of the gene to change the codon 
at amino acid position 266 in the mature gene from Threonine to 
Glycine(ACC to GGC), the T266G change. Two PGR mixtures vere 
prepared y in a manner similar to that described in Reference Examples 
7 and 8 > In the first reaction primers vere 2264 (SEQ ID NO: 48) and 
1045 (SEQ ID NO: 73, replacing SEQ ID NO: 61). In the second reaction 
primers were 6HIS9E10R1BS1 (SEQ ID NO: 42) and 55 (SEQ ID NO: 74, 
replacing SEQ ID NO: 62). In both reactions the starting DNA vas 
PMC43.1. 

Methods of PGR, cloning, expression and identification vere 
the same as for Example 14. From the sequencing results a clone 
containing a plasmid with the required [G251N,D253K,T266G]HCPB gene 
sequence vas selected, and is known as pHC47.1. 

Example 16 

Expression of other Mutant HCPBs vith peptide tag using E.coli 

A number of other mutant HCPBs have been constructed in a 
similar manner to the methods described in Reference Examples 7 and 8 « 
and Examples 12, 13, 14 and 15 > In each case oligos vere constructed 
to introduce specific changes in the gene sequence by methods similar 
to those described above. In some cases additional mutations vere 
identified upon complete sequencing of the mutated gene, and these 
vere presumably introduced during the PGR reactions. Mutants vith 
enz}mie activity against substrate analogues (see Example 18 belov) 
have been identified, and examples of these are given in Table 4 
belov. Enzjrme activity is shown relative to that found from pIGI1713 
(D253K HCPB) or pICI1746 {D253R HCPB) in E.coli MSD213 by release of 
the periplasmic protein fraction by osmotic shock. This material is 
subsequently referred to as periplasmic shockate or shockate and is 
prepared by the following procedure. 

1. A single colony was used to inoculate 75ml of L-broth nutrient 
media containing lOyg/ml tetracycline and arabinose to a final 
concentration of 0.01% (w/v). Incubation was performed at 20**C with 
shaking (250rpm) for about 60 hours. 
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2. The cells were then harvested by centrifugation at 4*»C» 

3. The cell pellet was re-suspended in 300micro-litres of 20% sucrose 
(w/v) containing ImH EDTA in 200mM Tris-HCl at pH7,5, and incubated at 
room-temperature for 15 minutes. 

4. Periplasmic shockate was generated by the addition of 
450micro-litres of distilled water, and a further incubation at 
room-temperature for 15 minutes. 

5. Cellular remains were removed by centrifugation, and the 
supernatant assayed for enzyme activity as soon as possible after 
preparation and kept at 4^*0 prior to assay. 

Table 4 

Activity relative to D253K HCPB or D253R HCPB in crude E.coli 
periplasmic shockate 

Mutation Hipp-Glu Hipp-Asp Hipp-Arg 



[Q54R,D145A,D253K1 


a 


a 


II245S,D253K] 


a 


a 


II245A,D253KJ 


a 


a 


II245H,D253K] 


a 


NAD 


IA248H,D253K] 


a 


NAD 


C288S 






C288A 






(G251K,D253R] 


a 


NAD 


[I201S,D253KJ 


NAD 


b 


[G251Q,D253K1 


a 


NAD 


[G251S,D253K] 


e 


a 


(G251V,D253K] 


a 


NAD 


IA248N,G251S,D253K] 


a 


a 


[A248S,G251S,D253K] 


a 


NAD 


[I201T,D253RI 


b 


b 
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a « activity equivalent to D253K HCPB 
b e activity equivalent to D253R HCPB 

c « activity 75% of mature HCPB (described in Reference Examples 7 and 8) 
d « activity 25% of mature HCPB (described in Reference Examples 7 and 8) 
e « >10 times the activity of D253K HCPB 
NAD a No activity detected 

Example 17 

Assay of activity of a range of human CPB mutants against Hipp-Glu, 
Hipp-Asp and Hipp-Arg prodrug analogues. 

This example builds on a range of mutants described in 

Example 6. 

Purified mutants of human CPB (D253K; IG251K,D253R] ; 

[G251N,D253R]; [12018, D253K] ; [G251N,D253K] ; IQ54R,D145A,D253KJ ; 

[G251T,D253K]; (G251S,D253K) ; IA248N,G251N,D253K] ; IA248S,G251N,D253K1 
and [S205N,G251N,D253K] - described in Examples 3, 12, 13, 16, 28, 30, 
31, 32 and 33) were assayed for their ability to convert 
hippuryl-L-glutamic acid (Hipp-Glu - Reference Example 1), 
hippuryl-L-aspartic acid (Hipp-Asp - Reference Example 2) and 
hippuryl-L-arginine (Sigma Chemical Company - cat no. H6625) to 
hippuric acid using a HPLC based assay similar to that described in 
Example 6* 

The reaction mixture (500^1) contained mutant human CPB 
(0.01 - 12.5 Mg depending on mutant) and 0.5mM Hipp-Glu or Hipp-Asp or 
Hipp-Arg in 0.025M Tris-HCL, pH7.5. Samples were incubated for 30 
minutes at 37^C. The reactions were terminated by the addition of 
500m1 of 40% methanol, 60% distilled water, 0.2% trifluoro acetic acid 
and the amount of hippuric acid generated was quantitated by HPLC as 
described in Example 6 but using a mobile phase of 20% methanol, 80% 
50mM phosphate buffer, pH 6.5. Hippuric acid was detected at 230nm. 
The results are shown in Table 5 and are expressed as the percentage 
conversion of substrate into product in 30 minutes at 37**C. 
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Table 5 

Conversion of Hipp-Glu, Hipp-Asp 

Mutant Concentration 

(Mg/ml) 

D253K 25 

2.5 

IG251K,D253R] 25 

2.5 

[G251N,D253R] 25 

2.5 

[I201S,D253KI 25 

2.5 

IG251N,D253KJ 25 

2.5 

[G251R,D145A,D253K] 25 

2.5 

[G251T,D253KJ 25 

0.02 

IG251S,D253K] 25 

0.25 

IA248N,G251N,D253K] 25 

0.25 
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or Hipp-Arg by HCPB mutants. 



Substrates 

Hipp-Glu Hipp-Asp Hipp-Arg 
(% conversion) 

79 22 0 

14.6 2.9 0 

5.5 0.2 0 
3.2 0.2 0 

57 9.4 0 

8.8 1.4 0 

27.8 33.1 0 

4.6 5.8 0 

100 8.3 1.8 

62.4 1.9 0.4 

90 30 0 

26 5.2 0 

100 14 5.2 

8.2 0 0 

100 2.8 0.8 

14.5 0 0 

100 2.5 1.4 

25.3 0.6 0 
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Mutant 



Concentration 
(pg/ml) 



Substrates 
Hipp-Glu Hipp-Asp Hipp-Arg 

(2 conversion) 



IA248S,G251N,D253K] 25 

0.5 



86 

10.7 



1.3 
0 



0 
0 



[S205N,G251N,D253K] 25 

0.5 



85 
7.9 



0.48 
0 



0.2 

0 



The data demonstrate that all 11 mutants have the ability to 
turn over Hlpp-Glu and Hipp-Asp substrates and all show minimal or no 
ability to convert Hipp-Arg (substrate for native human CPB, Example 
6). The best mutants in this assay were (G251T,D253K] for Hipp-Glu and 
II201S,D253KJ for Hipp-Asp. 

Example 18 

Assay of activity of [G251T,D253KIHCPB and other HCPB mutants. 

The activity of IG251T,D253KJHCPB was measured using the 
assay conditions described in Example 17 but using either 50yl of a 
neat or 1 in 10 or 1 in 100 dilution of IG251T,D253KJHCPB crude 
E.coli periplasmic shockate (shockate) sample in place of purified 
enzyme. Samples were incubated for 24h at 37 »C. For comparison D253K 
HCPB was assayed in a similar assay except that the reaction volume 
was reduced to 250^1 and contained 125m1 neat D253K shockate sample. 
For other HCPB mutants, the reaction volume was 250^1 and contained 
125m1 of neat shockate or 1 in 50 dilution of shockate. The shockate 
samples were prepared as described in Example 16. The amount of 
Hippuric acid produced in 24h was quantified by HPLC as described in 
Example 17 and the results are shown in Table 6 and are expressed as 
the percentage conversion of substrate into product in 24h at 37"'C. 
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Conversion of Hipp-Glu, Hipp-Asp and Hipp-Arg by [G251T,D253K]HCPB and 
otber HCPB mutants 



Mutant Shockate concentration Substrates 

in reaction mixture Hipp-Glu Hipp-Asp 



(X) 



Hipp-Arg 



(2 conversion) 



ID253KJ 



50 



7.4 



2.0 



IG251T,D253K] 



10 
1 

0.1 



100 
93 
60 



2.6 
0 



0.8 
0.1 



[G251N,D253K,T266G| 50 

[G251S,D253K] 50 

1 



0.95 

93.8 
5 



3.4 
1.6 



[A248N,G251T,D253KJ 50 

1 



95 

53.5 



[A248S,G251T,D253KJ 50 

1 



91.3 
20 



0.5 



IG251T,D253R] 



50 



IA248N,G251N,D253K] 50 



[A248S,G251N,D253K] 50 



IS205N,G251N,D253K] 50 



65.4 
63.9 
46.1 
14.1 



14.6 
0 
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The data demonstrate that the mutant [G251T,D253K] is at 
least 50 fold more active than D253K at converting Hipp-Glu to 
hippuric acid. The [G251T,D253K1 mutant is over 900 fold more active 
against Hipp-Glu compared to Hipp-Arg (substrate for native CPB, 
Example 6). 

Example 19 

cytotoxicity of prodrug of Example 21 and corresponding drug of 
Example 22 in LoVo tumour cells. 

The differential cytotoxicity of the prodrug of Example 21 
and the drug of Example 22 in LoVo human colorectal tumour cells was 
demonstrated as described in Example 9. 

LoVo tumour cells treated with the prodrug had an IC50 of 
905uM while the cells treated with the drug had an IC50 of 84pM (mean 
data from 3 separate studies). A representative study is shown in 
Figure 15. Thus the prodrug was over 10 fold less C3rtotoxic to LoVo 
colorectal tumour cells than the drug demonstrating its utility for 
use with suitable mutants of HCPB described herein. 

Vhen the D253K HCPB mutant, produced as described in 
Example 3, was added to the assay wells containing LoVo tumour cells 
and the prodrug of Example 21 enhanced cell kill was seen. Addition of 
between 2.4 and 11.75 yg/ml D253K HCPB to the prodrug (500mM) resulted 
in toxicity which matched that seen with 200yM of the active drug of 
Example 22 (Figure 16). These studies further demonstrate the 
potential of mutant enzjmies of human CPB to selectively convert a 
relatively non-cytotoxic prodrug into a potent cytotoxic drug capable 
of killing tumour cells. 

Example 20 

Conversion of prodrug of Example 24 by D253K HCPB and other HCPB 
mutants . 

The ability of purified D253K HCPB (Example 3) to convert 
the prodrug of Example 24 to the drug of Example 25 was demonstrated 
as follows. 
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A reaction mixture (500pl) containing D253K HCPB (7.5ug), 
O.SmM prodrug in 0.025M Tris-HCl buffer, pH 7.5 was incubated for 5 
minutes at 37»C. The reaction was stopped by adding MeCN (500^1) plus 
0.2X Irifluoroacetic acid. The amount of drug produced was then 
quantified by HPLC. 

HPLC separation was carried out as described in Example 6 
except that a mobile phase of 70% HeCN, 30X distilled water and O.U 
trlfluoroacetic acid was used to achieve separation of prodrug 
(retention time 3.8 minutes) and drug (retention time 4.9 minutes) and 
the compounds were detected at 260nm. The amount of drug produced was 
quantified from calibration curves generated with known amounts of 
drug. 

D253K HCPB (15Mg/ml) resulted in hydrolysis of 70.4 X of 
prodrug to drug in this assay in 5 minutes at 37"C (drug concentration 
at end of reaction was 0.352mM). 

The conversion of prodrug to drug by other HCPB mutants 
using the same assay conditions is shown in Table 7. The amount of 
(Q54R,D145A,D253K)HCPB and [G251T,D253KJHCPB mutants was reduced to 
0.75Mg in the reaction mixture. 

Table 7 

Conversion of prodrug of Example 24 by HCPB mutants 

Mutant Concentration Prodrug Hydrolysis 

(Pg/ml) {%) 

D253K 15 70.4 

D253R 15 18.3 

[G251K,D253R] 15 16.2 

IG251N,D253RI 15 66.7 

II201S,D253KJ 15 72.6 
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Mutant Concentration Prodrug Hydrolysis 

(Vig/ml) (2) 

[G25IN,D253K] 15 75.9 

[Q54R,D145A,D253K1 1.5 26.0 

[G251T,D253K] 1.5 36.0 

The data demonstrate that HCPB mutants can convert prodrug 
to drug providing further evidence that the activity seen against the 
model substrates Hipp-Glu and Hipp-Asp is applicable to mustard 
prodrugs. 

Example 21 

Preparation of 

N- [N- ( 4- {4- [bis- ( 2-chloroethyl) -amino J -3-methyl-phenoxy) -benzoyl) -L-al 
anine]-L-glutamic acid (see Figure 17 for reaction scheme) 

To a solution of 

N- [N-(4- {4- [bis- (2-chloroethyl) -amino] -3-methyl-phenoxy) -benzoyl)-L- 
alanine-L-glutamic acid dibenzyl ester (compound 8; 130 mg) in ethyl 
acetate (5 mL) was added 30X Pd/C (50X moist; 25mg). The mixture was 
stirred under an atmosphere of hydrogen for 1 hour. The catalyst was 
removed by filtration through CELITE (diatomaceous silica) and the 
filtrate evaporated to dr5mess to give the titled compound (compound 
11) as an oil, 88 mg (88% yield). 

NMR DMSOdg 7.9-6.6 (m, 7H) ; 4.85 (m, IH) ; 4.6 (m, IH); 3.4 (m, 8H); 
2.25 (s, 3H); 2.4-1.9 (m, 4H); 1.45 (d, 3H); 
MS ESI, 566 [M-H]* 

Starting material compound 8 was made as follows. 
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4-hydroxybenzoic acid (13.8 g, 0.1 mole) was dissolved In 
methanol and to this solution was added, sodium methoxide (10.8 g» 
0.2 mole). The solution vas then evaporated to dryness. To the 
resulting solid vas added DMF (500 mL) followed by 

4- fluoro-2-methyl-l-nitrobenzene (available from Aldrich under 

5- fluoro-2-nitro-toluene) (10.2 mL 0.1 mole). The mixture was heated 
at 125'*C for 2 hours, cooled and poured into 3L of water, acidified to 
pH2 with 2M HCl and extracted twice with ethyl acetate. The combined 
organic layers were washed with water, dried and evaporated to 
dryness. The resulting solid was triturated with ether to give 

4- (3-methyl-4-nitro-phenoxy) -benzoic acid (compound 1) as a white 
solid 6.1g (22% yield; melting point - 187-190**C). 

To a solution of isobutylene (34 g) in dichloromethane (100 
mL) was added compound 1 (5g), at S^'C, followed by concentrated 
sulphuric acid (0.5 mL). The mixture was stirred at ambient 
temperature for 2 days and poured into saturated sodium bicarbonate 
solution (200 mL). The aqueous layer was extracted with 
dichloromethane and the combined organic extracts were dried and 
evaporated to give an an oil. The oil was chromatographed with 5X 
ethyl acetate in hexane to give 4- (3-methyl-4-nitro-phenoxy) -benzoic 
acid tert-butyl ester (compound 2), 2.5 g (42% yield; melting point « 
81-83*»C). 

To a solution of compound 2 (2.15 g 5 mM) in ethyl acetate 
(35 mL) was added 30% Pd/C (50% moist) (400 rag). The mixture was 
stirred under an atmosphere of hydrogen for 2 hours. The mixture was 
filtered through CELITE and the filtrate evaporated to give 
4- (4-amino-3-methyl-phenoxy) -benzoic acid tert- butyl ester (compound 
3) as a solid (1.9g, 99% yield). Melting point, 84.86*C. 

To a solution of compound 3 (2g) in 1:1 acetic acid/water 
(30 mL) was passed in ethylene oxide (5g). After standing at ambient 
temperature for 2 days the mixture was poured into saturated sodium 
bicarbonate solution (200 mL) and extracted twice with ethyl acetate. 
The combined organic extracts were washed with water, dried and 
evaporated to give 

4- { 4- [ bis ( 2 -hydroxyethyl ) - amino ] - 3 -methyl-phenoxy) -benzoic acid 
tert -butyl ester (compound 4) as an oil (2.5g, 99% yield). 
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NMR CDCI3 8.0 (d, 2H); 7.3-6-9 (m, 7H); 3.6 (t, 4H); 3.2 (t, 4H); 2.3 
(s, 3H); 1.5 (s, 9H). 

To a solution of compound 4 (2.5 g) in 1:1 
acetonitrile/carbon tetrachloride (90 mL) was added imidazole (1.7 g) 
and triphenylphosphene (6.6 g). The mixture was stirred at ambient 
temperature for 3 hours and evaporated to drsmess. The residue vas 
partitioned between IM citric acid solution and ethyl acetate. The 
organic layer vas washed with water, dried and evaporated to drjmess. 
The residue was chromatographed with hexane/ethyl acetate (9:1) to 
give 4- {4- [bis (2-chloroethyl) -amino) -3-methyl-phenoxy) -benzoic acid 
tert-butyl ester (compound 5) as an oil (1.2g, 44% yield). 
NMR CDCI3 7.95 (d, 2H); 7.2-6.9 (m, 7H); 3.4 (m, 8H); 2.3 (s, 3H); 1.6 
(s, 9H). 

A solution of compound 5 in dichloromethane (5 ml) and 
trifluoroacetic acid (10 mL) was allowed to stir at ambient 
temperature for 2 hours. The mixture was evaporated to dryness, 
azeotroped twice with ethyl acetate to give 

4- {4- [bis (2-chloroethyl) -amino J -3-methyl-phenoxy) -benzoic acid 
(compound 6) as a solid trifluoroacetate salt (1.0 g, 73% yield). 
Melting point, 91-3*^0. 

To a solution of N-Boc-L-Alanine-L-Glutamic acid dibenzyl 
ester (Beilharz et al^, 1983, 36, 751-8) (compound 7, 250 mg, 0.5 nM) 
in dichloromethane (2 mL) was added trifluoroacetic acid (4 mL). The 
solution was allowed to stand at ambient temperature for 1 hour and 
then evaporated to drjmess. The residue was re-dissolved in ethyl 
acetate, washed with saturated sodium bicarbonate solution, dried and 
evaporated to give an oil (compound 7 with BOC group removed). This 
oil in DMF (2 mL) was added to compound 6 (255 mg) in DHF (3 mL), 
followed by hydroxybenzotriazole (70 mg), 
l-(3-Dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride 
(DECI; 115 mg) and triethylamine (0.18 mL). The mixture was stirred 
at ambient temperature for 1 hour, poured into saturated sodium 
bicarbonate solution (100 mL) and ethyl acetate (100 mL). The organic 
layer was separated, washed with water and then 0.5M citric acid, 
dried and evaporated to dryness. The residue was chromatographed with 
ethyl acetate/hexane (1:1) to give the desired starting material as an 
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oil (150 mg, UOX yield). 

NMR CDCI3 7.8 (d, 2H); 7.4-6.7 (m, 7H); 5.2 (s, 2H); 5.0 (s, 2H); 4.7 

(m, 2H); 3.4 (m, 8H); 2.3 (s, 3H); 2.45-2.0 (m, 4H); 1.4 (d, 3H). 



Example 22 
Preparation of 

N-(4- {4- [bis- (2-chloroet:hyl) -amino] -3-methyl-phenox7) -benzoyl)-L- 
alanine 

(see Figure 17 for reaction scheme) 

N-(4-{4-[bis-(2-chloroethyl)-amlnol-3-methyl-phenoxy)- 
benzoyl) -L-alanine- tert -butyl ester (400mg; compound 9) was dissolved 
in dichloromethane (4 ml) and trifluoroacetic acid (8 ml) was added. 
The mixture was allowed to stand at ambient temperature for 1 hour, 
evaporated to dryness and azeotroped twice with ethyl acetate to give 
the titled compound (compound 10; the drug corresponding with the 
prodrug of Example 21) as an oil (0.43g, 52% yield). 
NMR DMSOdg 8.5 (d, IH); 7.8-6.5 (ra, 7H); 4.4 (m, IH) ; 3.55 (t, 4H); 
3.35 (t, 4H)5 2.3 (s, 3H); 1.4 (d, 3H) . 
MS (M+H)"*", 439 

Starting material compound 9 was prepared as follows. 

To a mixture of 4-{4-(bis(2-chloroethyl)-amino]-3-methyl- 
phenoxy) -benzoic acid (compound 6 in example 21) (586mg, 1 romole) in 
dimethylformamide (2 mL) was added hydroxybenzotriazole (135 mg), 
l-(3-Dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride (230 mg), 
then L-alanine- tert -butyl ester hydrochloride (181 mg) and 
triethylamine (0.54 mL). The mixture was stirred at ambient 
temperature for 2 hours, poured into saturated sodium bicarbonate 
solution (60 mL), extracted twice with ethyl acetate and the combined 
extracts washed with water, washed with IN citric acid solution, dried 
and evaporated to dryness. The residue was chromatographed with 
hexane/ethyl acetate (4:1) to give the desired starting material as an 
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oil (0.4g, 81% yield). 

NMR CDCI3 7.8 (d, 2H); 7.3-6.7 (m, 7H) ; 4.6 (m, IH); 3.4 (m, 8H); 2.3 
(s, 3H); 1.5 (d, 3H); 1.5 (s, 9H). 

Example 23 

Anti-tumour activity of prodrugs and humanised antibody-mutant HCPB 
fusion protein in xenografted mice. 

The anti-tumour efficacy of suitable prodrugs and humanised 
antibody-mutant HCPB fusion protein (Example 10) can be demonstrated 
in the following model. 

LoVo colorectal tumour cells (ECACC no. 87060101) (1 X lo') 
are injected s.c. into athymic nude mice. When the tumours are 4-5mm 
in diameter the conjugate is administered i.v. at doses between 10-100 
mg/kg. Following localisation of the fusion protein to the tumours and 
allowing a suitable time interval for residual conjugate to clear from 
the bloodstream and normal tissues (1-4 days) the prodrug is 
administered either i.v or i.p. to the mice in doses ranging between 
10-1000 mg/kg either as a single or multiple doses. The combination of 
antibody-enzyme fusion protein and prodrug cause the tumours to grow 
significantly slower than untreated control tumours or tumours treated 
with either the same dose of conjugate or prodrug alone. These studies 
demonstrate that the combination of the humanised antibody-mutant CPB 
fusion protein and the mutant CPB prodrugs result in significant 
anti- tumour activity. 

Example 24 
Preparation of 

N- [N- (4- {4- [bis- (2-chloroethyl) -amino] -phenoxy} -benzoyl )-L-alanine] -L- 
glutamic acid 

The titled compound was prepared in an analogous manner to 
that set out in Example 21 but substituting 4- (4-nitrophenoxy) -benzoic 
acid (Ravick et al. (1933), JACS, 55, 1289-1290) for 
4- (3-methyl-4-nitrophenoxy) -benzoic acid (compound 1 in Example 21). 
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NHRDMSOdg, 8.4 (d, IH) ; 8.3 (d, IH); 7.8 (d, IH); 7.05-6.75 (m, 6H)j 

4.5 (m, IH); 4.25 (in, IH); 3.7 (s, 8H); 2.4-1.6 (m, 4H); 1.4 (d, 3H). 
MS ESP, 551[M-H1" 

Example 25 
Preparation of 

N.(4>{4- [bis-(2-chloroethyl)-aminoJ -phenoxy) -benzoyl) -L-alanine 

The titled compound (which is the drug corresponding with 
the prodrug of Example 24) was prepared in an analogous manner to that 
set out in Example 22 but substituting 

4-{4-Ibis(2-chloroethyl)-amino]-phenoxy)-ben2oic acid (which is 
prepared as an intermediate in Example 24) for 

4-{4-Ibis(2-chloroethyl)-aminol-3-methyl-phenoxy)-benzoic acid. 
NMR CDCI3, 7.75 (d, 2H); 7.0-6.4 (m, 6H); 4.6 (m, IH); 3.6-3.4 
(m, 8H); 1.6 (d, 2H). 
MS ESP, 423 [M-HJ" 

Example 26 

Purification of D253R HCPB-HiSg-cNyc from E. coli 

The procedure described in Example 11 was repeated with 
plasmid pICI1713 replaced by pICI1746 (described in Example 2). 
Twelve 2L Erlenmeyer flasks containing 600ml of L-tetracydine were 
used for the fermentation in place of the nine flasks used in Example 
11. 

Recombinant E.coli cell paste was taken from storage at 
-70»C and allowed to thaw. The weight of cell paste was measured and 
found to be 82. 4g. The paste was resuspended with the addition of 
buffer A [200mM Tris-HCl (pH 8.0), 20% sucrose) to give a resuspended 
volume of 130ml. The cell suspension was incubated at room 
temperature for 20 minutes with occasional gentle mixing before an 
equal volume of distilled water, at room temperature, was added and 
thoroughly mixed in. The cell suspension was again incubated at room 
temperature for 20 minutes with occassional gentle mixing. The 
resulting crude osmotic shockate was clarified by centrifugation at 
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98000xg for 90 minutes at 4*>C after which the supernatant was decanted 
off from the pelleted insoluble fraction. The supernatant was diluted 
1:1 with lOmM Tris-HCl, 500mM sodium chloride, pH 8.0 (Buffer B), 
adjusted to pH 8.0 to a total volume of SOOral and loaded, over night, 
at 0.5ml/min, onto a Carboxypeptidase inhibitor affinity column. The 
column having been prepared according to instructions with the 
CNBr-activated Sepharose 4B (a preactivated 4% agarose gel for 
immobilisation of ligands containing primary amines; Pharmacia Cat. 
No. 17-0430-01) and carboxypeptidase inhibitor from potato tuber 
(c-0279, Sigma). The amount of matrix used for this size of 
carboxypeptidase purification was 15ml packed in a Pharmacia XK 16 
column. To produce a 15 ml quantity of matrix, 5g of dry matrix and 
80mg of Carboxypeptidase inhibitor were used. The procedure for 
preparing the affinity column was as follows: 

Freeze dried matrix (5g) was suspended in ImM HCl. The 
swollen gel was washed on a sintered glass filter with ImH HCl (11) 
added in several aliquots. The carboxypeptidase inhibitor (80mg) was 
dissolved in O.IM sodium bicarbonate (pH 8.3) containing 0.5M NaCl 
(coupling buffer; 25ml) then mixed with the gel in a stoppered vessel. 
The mixture was rotated end-over-end for Ih at room temperature or 4*»C 
for 18h then excess ligand was washed away with at least 5 gel volumes 
of coupling buffer. The gel was transferred to O.IM Tris-HCl (pH 8.0) 
and left to stand for 2h at room temperature, then washed with at 
least 5 gel volumes of O.IM acetate buffer (pH 4.0) containing 0.5M 
NaCl followed by at least 5 gel volumes of O.IM Tris-HCl (pH 8.0) 
containing 0.5M NaCl. 

The column was pre-equilibrated with buffer B at 4**C. After 
loading the supernatant, the column was washed until the absorbance of 
the flow through was back to baseline before the bound material was 
eluted from the column by elution buffer (lOOmM sodium carbonate, 
500mM sodium chloride, pH 11.4) at 4*^C, with 1ml fractions being 
collected. The eluted fractions were frozen at -20**C after samples 
were taken to determine those containing the recombinant 
carboxjrpeptidase B. This was accomplished by Western blot analysis 
using an anti-c-myc tag antibody (9E10), followed by an anti-mouse 
horse radish peroxidase conjugate (a-9044, Sigma) that gave a colour 
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reaction with exposure to 4-chloronaphthol and hydrogen peroxide, or 
by silver-stained polyacrylamide gel electrophoresis. Fractions 20 to 
66 were determined to contain the recombinant carboxypeptidase B. The 
diluted supernatant collected from the column during the initial 
loading was re-passaged over the column, after the column had been re 
equilibrated. Elution conditions and analysis of the fractions 
collected were identical to those from the first elution. Fractions 
25 to 60 were determined to contain the recombinant carboxypeptidase 
B. These were pooled with the fractions from the first passage, the 
pH adjusted to pH7.5 and concentrated using a Millipore Centrifugal 
Ultrafree -20 (a 10,000 molecular weight cut off filtration device) 
before being snap-frozen and stored at -SO'^C. The purification 
detailed here provided 3.5mg/ml of D253R mutant HCPB at a purity of 
87% in a volume of 550 microlitres. 

Example 27 

Purification of tG251N,D253KlHCPB-HiSg-cMyc from E- coli 

The procedure described in Example 26 was repeated with 
plasmid pICI1746 replaced by pMCAS.l (described in Example 13). The 
weight of cell paste was 94g which was resuspended in 110ml of buffer 
A. The volume of osmotic shockate, after dilution with buffer B, 
loaded onto the potator inhibitor column was 500ml. In the first 
elution, fractions 10 to 27 were collected. In the second passage 
fractions 10 to 39 were collected on elution. The purification 
provided 1.24mg/ml of [G251N,D253K]HCPB at a purity of 95% in a volume 
of 900 microlitres. 

Example 28 

Purification of [G251T,D253K]HCPB-HiSg-cMyc from E. coli 

The procedure described in Example 26 was repeated with 
plasmid pICI1746 replaced by pMC46.4.1 (described in Example 14). The 
weight of cell paste was 46g which was resuspended in 65ml of buffer 
A. The volume of osmotic shockate, after dilution with buffer B, 
loaded onto the potator inhibitor column was 260ml. In the first 
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elution, fractions 8 to 28 were collected. In the second passage 
fractions 41 to 79 were collected on elution. The purification 
provided 0.67mg/inl of [G251T,D253K]HCPB at a purity of 95X in a volume 
of 500 microlitres. 

Example 29 

Pharmaceutical composition 

The following illustrates a representative pharmaceutical 
dosage form of the present invention which may be used for therapeutic 
purpose in humans. 

Injectable solutions 

A sterile aqueous solution, for injection, containing per ml 



of solution: 

Antibody-enzyme of Example 10 l.Omg 

Sodium acetate trihydrate 6.8mg 

Sodium chloride 7.2mg 

Tween 20 O.OSmg 



A typical dose of conjugate is 30 mg followed 3 days later by prodrug. 

ii) Assemble the following for final prodrug dosage form 

preparation: glass vials (3 x 20ml) each containing 600mg of prodrug 
of Example 21; 3 ampoules containing 11ml of 2.15% (w/v) sodium 
hydrogen carbonate; needles (3 x 18G); hydrophobic filters for venting 
the vials; and 3 x single use sterile 0-22 micron filters for aqueous 
solutions. All materials must be stored at 2-8''C. 

These operations are preferably to be performed under 
sterile conditions. No more than 1 hour prior to dosing, one vial of 
prodrug is vented with a needle and hydrophobic filter. Sterile 2.15% 
w/v sodium hydrogen carbonate (10 ml) is then added directly through 
the bung via a syringe and needle. Vlth the vent still in place the 
vial is swirled gently to obtain a clear solution (this will be 50 
mg/ml as free base). The required dose volume is withdrawn into a 
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Sterile syringe through a sterile filter. The filter is then replaced 
by a sheathed sterile needle and the syringe unit kept cool prior to 
administration. Each remaining vial is prepared in an identical 
manner at intervals of one hour to allow for example three separate 
doses to be given 1 hour apart. 

Example 30 

Cloning, expression and purification of [G251S,D253K]HCPB-(His)g-c-Myc 
from E. coli 

The method of cloning and expressing the [G251S,D253K]HCPB 
in E.coli was very similar to the method described in Reference 
Example 8. The gene for [G251S,D253K]HCPB was prepared as described 
in Example 2 but the starting material for PGR site directed 
mutagenesis was the [D253K)HCPB gene in plasmid pICI1713 (described in 
Example 1) in place of pICI1712. However, in this case site directed 
mutagenesis was used during the PGR amplification of the gene to 
change the codon at amino acid position 251 in the mature gene from 
Glycine to Serine(GGC to TCT), the G251S change. Two PGR mixtures 
were prepared, in a manner similar to that described in Reference 
Examples 7 and 8 . In the first reaction primers were 2264 (SEQ ID NO: 
48) and 1038 (SEQ ID NO: 68, replacing SEQ ID NO: 61). In the second 
reaction primers were 6HIS9E10R1BS1 (SEQ ID NO: 42) and 54 (SEQ ID NO: 
75, replacing SEQ ID NO: 62). In both reactions the starting DNA was 
pICI1713. 

Methods of PGR, cloning, expression and identification were 
the same as for Example 14. From the sequencing results a clone 
containing a plasmid with the required IG251S,D253K]HGPB gene sequence 
was selected, and is known as pMC49.2. 

For purification of (G251S,D253K]HCPB, the procedure 
described in Example 26 was repeated with plasmid pICI1746 replaced by 
pMG49.2. The weight of cell paste was 77g which was resuspended in 
lOOral of buffer A. The volume of osmotic shockate, after dilution 
with buffer B, loaded onto the potator inhibitor column was 395ml. In 
the first elution, fractions 11 to 40 were collected. In the second 
passage fractions 12 to 33 were collected on elution. The 
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purification provided 1.7 mg/ml of IG251S,D253K]HCPB at a purity of 
86% in a volume of 500 microlitres. 

Example 31 

Cloning, expression and purification of 
[A248N,G251N,D253KIHCPB-(His)g-c-Myc from E. coll 

The method of cloning and expressing the 
[A248N,G251N,D253KJHCPB in E.coli was very similar to the method 
described in Reference Example B * The gene for 

[A248N,G251N,D253KIHCPB was prepared as described in Example 2 but the 
starting material for PCR site directed mutagenesis was the 
[G251N,D253KIHCPB gene in plasmid pMC43.1 (described in Example 13 ) in 
place of pICI1712. However, in this case site directed mutagenesis 
was used during the PCR amplification of the gene to change the codon 
at amino acid position 248 in the mature gene from Alanine to 
Asparagine (GCT to AAC), the G248N change. Two PCR mixtures were 
prepared, in a manner similar to that described in Reference Examples 
7 and 8 . In the first reaction primers were 2264 (SEQ ID NO: 48) and 
1024 (SEQ ID NO: 76, replacing SEQ ID NO: 61). In the second reaction 
primers were 6HIS9E10R1BS1 (SEQ ID NO: 42) and 1028 (SEQ ID NO: 77, 
replacing SEQ ID NO: 62). In both reactions the starting DNA was 
pHC43.1. 

Methods of PCR, cloning, expression and identification were 
the same as for Example 14. From the sequencing results a clone 
containing a plasmid with the required IA248N,G251N,D253K]HCPB gene 
sequence was selected , and is known as pHC50.2. 

For purification of [A248N,G251N,D253K]HCPB, the procedure 
described in Example 26 was repeated with plasmid pICI1746 replaced by 
pNC50.2. The weight of cell paste was 83g which was resuspended in 
100ml of buffer A. The volume of osmotic shockate, after dilution 
with buffer B, loaded onto the potator inhibitor column was 560ml. In 
the first elution, fractions 23 to 40 were collected. In the second 
passage fractions 18 to 40 were collected on elution. The 
purification provided 1.0 mg/ml of [A248N,G251N,D253KJHCPB at a purity 
of 90% in a volume of 1000 microlitres. 
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Example 32 

Cloning, expression and purification of 
IA248S,(S251H,D253KJHCPB-(His)g-c-Myc from E. coli 

The method of cloning and expressing the 
[A248S,G251N,D253K1HCPB in E.coli was very similar to the method 
described in Reference Example 8 > The gene for 

[A248S,G251N,D253K]HCPB was prepared as described in Example 2 but the 
starting material for PCR site directed mutagenesis was the 
[G251N,D253KJHCPB gene in plasmid pMC43.1 (described in Example 13 ) in 
place of pICI1712* However, in this case site directed mutagenesis 
was used during the PCR amplification of the gene to change the codon 
at amino acid position 248 in the mature gene from Alanine to Serine 
(GCT to TTC), the G248S change. Two PCR mixtures were prepared, in a 
manner similar to that described in Reference Examples 7 and 8 . In 
the first reaction primers were 2264 (SEQ ID NO: 48) and 1024 (SEQ ID 
NO: 76, replacing SEQ ID NO: 61). In the second reaction primers were 
6HIS9E10R1BS1 (SEQ ID NO: 42) and 1030 (SEQ ID NO: 78, replacing SEQ 
ID NO: 62). In both reactions the starting DNA was pMC43.1. 

Methods of PCR, cloning, expression and identification were 
the same as for Example 14. From the sequencing results a clone 
containing a plasmid with the required [A248S,G251N,D253KIHCPB gene 
sequence was selected, and is known as pHC51.2. 

For purification of IA248S,G251N,D253K]HCPB, the procedure 
described in Example 26 was repeated with plasmid pICI1746 replaced by 
pHC51.2. The weight of cell paste was 71 .Sg which was resuspended in 
lOOml of buffer A. The volume of osmotic shockate, after dilution 
with buffer B, loaded onto the potator inhibitor column was 520ml. In 
the first elution, fractions 10 to 40 were collected. In the second 
passage fractions 10 to 32 were collected on elution. In a third 
passage fractions 15 to 40 were collected on elution. The 
purification provided 0.8 mg/ml of [A248S,G251N,D253K]HCPB at a purity 
of 80.5% in a volume of 1500 microlitres. 



SIIRfiTITtlTC eUBCT/DIII B 9e\ 



wo 97/07769 



PCT/GB96/01975 



- 113 - 

Exaraple 33 

Cloning, expression and purification of 
[S205N,6251N,D253K]HCPB-(His)g-c-lfyc from E. coli 

The method of cloning and expressing the 
[S205N,G251N,D253KJHCPB in E.coll was very similar to the method 
described in Reference Example 8 > The gene for 

[S205N,G251N,D253K]HCPB vas prepared as described in Example 2 but the 
starting material for PCR site directed mutagenesis vas the 
[G251N,D253K]HCPB gene in plasmid pMC43-l (described in Example 13 ) in 
place of pICI1712. Hovever, in this case site directed mutagenesis 
vas used during the PCR amplification of the gene to change the codon 
at amino acid position 205 in the mature gene from Serine to 
Asparagine (TCA to AAC), the S205N change. Tvo PCR mixtures vere 
prepared y in a manner similar to that described in Reference Examples 
7 and 8 . In the first reaction primers vere 2264 (SEQ ID NO: 48) and 
1010 (SEQ ID NO: 79, replacing SEQ ID NO: 61). In the second reaction 
primers vere 6HIS9E10R1BS1 (SEQ ID NO: 42) and 1016 (SEQ ID NO: 80, 
replacing SEQ ID NO: 62). In both reactions the starting DNA vas 
PMC43.1. 

Methods of PCR, cloning, expression and identification vere 
the same as for Example 14. From the sequencing results a clone 
containing a plasmid vith the required [S205N,G251N,D253K]HCPB gene 
sequence vas selected, and is knovn as pMC52.1. 

For purification of [S205N,G251N,D253K]HCPB, the procedure 
described in Example 26 vas repeated vith plasmid pICI1746 replaced by 
pMC52.1« The veight of cell paste vas 77g vhich vas resuspended in 
100ml of buffer A. The volume of osmotic shockate, after dilution 
vith buffer B, loaded onto the potator inhibitor column vas 420ml. In 
the first elution, fractions 10 to 40 vere collected. In the second 
passage fractions 12 to 29 vere collected on elution. The 
purification provided 0.8 mg/ml of IS205N,G251N,D253K]HCPB at a purity 
of 85% in a volume of 650 microlltres. 
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Example 34 

Cloning and expression of [G251T,D253R]HCPB-(His)g-c-H7c from E. coli 

The method of cloning and expressing the [G251T,D253RIHCPB 
in E.coli was very similar to the method described in Reference 
Example 8 . The gene for IG251T,D253R]HCPB was prepared as described 
in Example 2 but the starting material for PGR site directed 
mutagenesis was the [D253RJHCPB gene in plasmid pICI1746 (described in 
Example 2 ) in place of pICI1712* However, in this case site directed 
mutagenesis was used during the PGR amplification of the gene to 
change the codon at amino acid position 251 in the mature gene from 
Glycine to Threonine (GGC to ACT), the G251T change. Two PGR mixtures 
were prepared, in a manner similar to that described in Reference 
Examples 7 and 8 . In the first reaction primers were 2264 (SEQ ID NO: 
48) and 1038 (SEQ ID NO: 68, replacing SEQ ID NO: 61). In the second 
reaction primers were 6HIS9E10R1BS1 (SEQ ID NO: 42) and 794 (SEQ ID 
NO: 81, replacing SEQ ID NO: 62). In both reactions the starting DNA 
was PICI1746. 

Methods of PGR, cloning, expression and identification were 
the same as for Example 14. From the sequencing results a clone 
containing a plasmid with the required [G251T,D253R]HCPB gene sequence 
was selected, and is known as pNC55.2. 

Example 35 

Cloning and expression of [I201T,D253RIHCPB-(His)g-c*H7c from E. coli 

The method of cloning and expressing the [I201T,D253R]HGPB 
in E.coli was very similar to the method described in Reference 
Example 8 . The gene for [I201T,D253R]HGPB was prepared as described 
in Example 2 but the starting material for PGR site directed 
mutagenesis was the [D253RJHCPB gene in plasmid pIGI1746 (described in 
Example 2 ) in place of pIGI1712. However, in this case site directed 
mutagenesis was used during the PGR amplification of the gene to 
change the codon at amino acid position 201 in the mature gene from 
Isoleucine to Threonine (ATG to ACT), the I201T change. Two PGR 
mixtures were prepared, in a manner similar to that described in 
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Reference Examples 7 and 8. in the first reaction primers were 2264 
(SEQ ID NO: 48) and 1003 (SEQ ID NO: 82, replacing SEQ ID NO: 61). In 
the second reaction primers were 6HIS9E10R1BS1 (SEQ ID NO: 42) and 795 
(SEQ ID NO: 83, replacing SEQ ID NO: 62). In both reactions the 
starting DNA was pICI1746. 

Methods of PCR, cloning, expression and identification were 
the same as for Example 14. From the sequencing results a done 
containing a plasmid with the required [I201T,D253R]HCPB gene sequence 
was selected, and is known as pMC57.2. 

Example 36 

Cloning and expression of [A248N,G251T,D253KlHCPB-(His)g-c-ll7c from 
E. cell 



The method of cloning and expressing the 
IA248N,G251T,D253KJHCPB in E.coli was very similar to the method 
described in Reference Example 8 . The gene for 

[A248N,G251T,D253KJHCPB was prepared as described in Example 2 but the 
starting material for PCR site directed mutagenesis was the 
IG251T,D253KJHCPB gene in plasmid pMC46.4 (described in Example 14 ^ in 
place of plCI1712. However, in this case site directed mutagenesis 
was used during the PCR amplification of the gene to change the codon 
at amino acid position 248 in the mature gene from Alanine to 
Asparagine (GCT to AAC), the G248N change. Two PCR mixtures were 
prepared, in a manner similar to that described in Reference Examples 
7_Md_8. In the first reaction primers were 2264 (SEQ ID NO: 48) and 
1024 (SEQ ID NO: 76, replacing SEQ ID NO: 61). In the second reaction 
primers were 6HIS9E10R1BS1 (SEQ ID NO: 42) and 1028 (SEQ ID NO: 77, 
replacing SEQ ID NO: 62). In both reactions the starting DNA was 
pHC46.4. 

Methods of PCR, cloning, expression and identification were 
the same as for Example 14. Prom the sequencing results a clone 
containing a plasmid with the required (A248N,G251T,D253K]HCPB gene 
sequence was selected, and is known as pHC59.3. 
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Example 37 

Cloning and expression of [A248S,G251T,D253KlHCPB-(His)g-c-Myc froa E. 
coli 

The method of cloning and expressing the 
(A248S,G251T,D253K]HCPB in E.coli was very similar to the method 
described in Reference Example 8 . The gene for 

[A248S,G251T,D253KJHCPB was prepared as described in Example 2 but the 
starting material for PCR site directed mutagenesis was the 
IG251T,D253K]HCPB gene in plasmid pMC46.4 (described in Example 14 ^ in 
place of pICI1712, However, in this case site directed mutagenesis 
was used during the PCR amplification of the gene to change the codon 
at amino acid position 248 in the mature gene from Alanine to Serine 
(GCT to TTC), the G248S change. Two PCR mixtures were prepared, in a 
manner similar to that described in Reference Examples 7 and 8 > In 
the first reaction primers were 2264 (SEQ ID NO: 48) and 1024 (SEQ ID 
NO: 76, replacing SEQ ID NO: 61). In the second reaction primers were 
6HIS9E10R1BS1 (SEQ ID NO: 42) and 1030 (SEQ ID NO: 78, replacing 
SEQ ID NO: 62). In both reactions the starting DNA was pMC46.4. 

Methods of PCR, cloning, expression and identification were 
the same as for Example 14. Prom the sequencing results a clone 
containing a plasmid with the required IA248S,G251T,D253K]HCPB gene 
sequence was selected, and is known as pHC60.3. 

Example 38 

Preparation of humanised A5B7 F(ab')2-IG251T,D253KIHCPB fusion protein 

The procedure described in Example 10 is repeated but with 
the sequence for D253K HCPB replaced by the sequence for 
[G251T,D253K]HCPB. The sequence for [G251T,D253K]HCPB is described in 
Example 14. 
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Example 39 

Purification and enzyme activity of [A248S,G251T,D253K]HCPB- 
(His)g-c-*]fyc 

The procedure described in Example 3 vas repeated with HSD 
2230 replaced by MSD 2812 (BSD 1924 pZEN1921). Plasmid pZEN1921 Is 
also known as pMC60.3, Preparation of plasrald pMC60.3 Is described In 
Example 37. Two samples of cell paste were processed separately. The 
weight of cell paste taken from storage was 593g in the first sample 
and 558g in the second sample. These were suspended in buffer A 
(750ml and 710ml respectively) to prepare an osmotic shockate. The 
volume of osmotic shockate, after dilution with buffer B, loaded onto 
two potator inhibitor columns in series was 31 for each sample. In 
the first purification, the fractions pooled from the two columns were 
18 to 80 (column 1) and 30 to 63 (column 2). In the second 
purification, the fractions pooled from the two columns were 21 to 73 
(column 1) and 23 to 68 (column 2). The combined purification 
provided 2.6 mg/ml of [A248S,G251T,D253K]HCPB at a purity of 83Z in a 
volume of 4.4ml. 



Enzyme activity against Hipp-Glu, Hipp-Asp and Hipp-Arg 
substrates was determined as described in Example 17. 

Concentration Ripp-Glu Hipp-Asp Hipp-Arg 
(pg/ml) {% conversion) 

25 93 5.5 0 

0.25 14.5 0 0 

Determination of Km and kcat against Hipp-Glu was as 
described in Example 7. 



Km (mN) 1.1 
kcat (s'^) 19 
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Example 40 

Purification and enzyne activity of [G251T,D253R]HCPB-(His)g-c-M7c 

The procedure described in Example 3 vas repeated with MSD 
2230 replaced by MSD 2803 (MSD 1924 pZEN1907). Plasmid pZEN1907 is 
also known as pMC55*2. Preparation of plasmid pMC55.2 is described in 
Example 34. Two samples of cell paste were processed separately. The 
weight of cell paste taken from storage was 660g in the first sample 
and 484g in the second sample. These were suspended in buffer A 
(800ml and 700ml respectively) to prepare an osmotic shockate. The 
volume of osmotic shockate, after dilution with buffer B, loaded onto 
two potator inhibitor columns in series was 1.61 and 1.41 
respectively. In the first purification, the fractions pooled from 
the two columns were 20 to 70 (column 1) and 20 to 65 (column 2). In 
the second purification, a single fraction was collected from both 
columns corresponding to the elution peak profile given on the chart 
recorder. The combined purification provided 3.6 mg/ml of 
[G251T,D253RIHCPB at a purity of 50% in a volume of 1.4ml. 

Enzyme activity against Hipp-Glu, Hipp-Asp and Hipp-Arg 
substrates was determined as described in Example 17. 

Concentration Hipp-Glu Hipp-Asp Hipp-Arg 
(yg/ml) (X conversion) 

25 96 29 0 

1 16.2 1.2 0 

Determination of Km and kcat against Hipp-Glu and Hipp-Asp 
was as described in Example 7. 
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Hlpp-Glu Hipp-Asp 
Km (ndf) 1.6 1.7 

kcat (s"^) 7.8 0.7 

kcat/Bn (inM"^s"^) 4.9 0.4 



ES70099 

AFG/HB : 30J0L96 
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Formula 1 
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Formula 3 



^(x)-< m) 



PCT/GB9«/01975 



Formula 4 



COJP.) 



Formulas 



r6r3 

N— T V— OCOHNt— i-C02H 
r5\r4 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ZENECA LIMITED 

(B) STREET: 15 STANHOPE GATE 

(C) CITY: LONDON 

(E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE (ZIP) : WIY 6LN 

(G) TELEPHONE: 0171 304 5000 

(H) TELEFAX: 0171 304 5151 

(I) TELEX: 0171 634 2042 

(ii) TITLE OF INVENTION: chemical compounds 
(iii) NUMBER OF SEQUENCES: 74 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CTCTAGGAAT TCTTATTAGT ACAGGTGTTC CAGGACGTAG C 
(2) INFORMATION FOR SEQ ID NO: 2; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CCCAAGCTTG CCGCCACCAT GTTGGCAGTC TTGGTTCTGG TGACTGTG6C CCTGGCATCT 60 
GCTGCAACAG GACACAGTTA TGAGAAGTAC AACAAGTGGG AAACGATA 108 



fitincTiTiiTc euBe^/Dtii c oe\ 



wo 97/07769 



-123- 



PCT/GB96/01975 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AACA6CTATG ACCATG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTAAAACGAC GGCCAGT 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCJRIPTION: SEQ ID NO: 5: 
TCGCTATTAC CATGGTGATG CGGTTTTGGC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CAGACTCTGC AGCAGGTCCA CAG 23 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CCCAAGCTTG CCGCCACCAT GTTGGCACTC TTGGTTCTGG TGACTGTGGC CCTG 54 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CTCATAACT6 AATTCTTATT AACCSAACCCG GCTATCAAA 39 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGATCTGCTG CCCAAGCTTA CTCCATGGTG ACCC 34 
(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTCTCATAA CTGTGTCCTG TTGCGAACAC GCT6CTCACC TCGGGCACTG TACATATGCA 
AGGCTTACAA CCACAATCCC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGTTGTAAGC CTTGCATATG TACAGTGCCC GAGGTGAGCA GCGTGTTCGC AACAGGACAC 
AGTTATGAGA AGTACAAC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCGTTTGATC TCGAGCTTGG TGCCTCC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATATAAAGCT TGCCGCCACC ATGAAGTTGT GGCTGAACTG GATTTTCCTT 50 



(2) INFORMATION FOR SEQ ID NO: 14: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATCGAATTCG CCGCCACCAT GGATTTTCAA GTGCAGATTT TCAGCTTC 48 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TGAGAATTCT TACTATGTAC ATATGCAAGG CTTACAACCA CAATC 45 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCGCCGAATT CTTATTAACA CTCATTCCTG TTGAA 35 
(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 17: 
GACCTGGAAC TCTGGATCTC TGTCCAGCGG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGGTGTGCAC ACCGCTGGAC AGAGATCCAG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TGGTACCAGC AGAAGCCAGG TTCCTCCCCC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 16. .765 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AAGCTTGCCG CCACC ATG AAG TTG TGG CTG AAC TGG ATT TTC CTT GTA ACA 51 
Met Lys Leu Trp Leu Asn Trp He Phe Leu Val Thr 
15 10 

CTT TTA AAT GGT ATC CAG TGT GAG GT6 AAG CTG GTG GAG TCT GGA GGA 99 
Leu Leu Asn Gly He Gin Cys Glu Val Lye Leu Val Glu Ser Gly Gly 
15 20 25 

GGC TTG GTA CAG CCT GGG GGT TCT CTG AGA CTC TCC TGT GCA ACT TCT 147 
Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Thr Ser 
30 35 40 

GGG TTC ACC TTC ACT GAT TAC TAC ATG AAC TGG GTC CGC CAG CCT CCA 195 
Gly Phe Thr Phe Thr Asp Tyr Tyr Met Asn Trp Val Arg Gin Pro Pro 
45 50 55 60 

GGA AAG GCA CTT GAG TGG TTG GGT TTT ATT GGA AAC AAA GCT AAT GGT 243 
Gly Lys Ala Leu Glu Trp Leu Gly Phe He Gly Asn Lys Ala Asn Gly 
65 70 75 

TAC ACA ACA GAG TAC AGT GCA TCT GTG AAG GGT CGG TTC ACC ATC TCC 291 
Tyr Thr Thr Glu Tyr Ser Ala Ser Val Lys Gly Arg Phe Thr He Ser 
80 85 90 

AGA GAC AAA TCC CAA AGC ATC CTC TAT CTT CAA ATG AAC ACC CTG AGA 339 
Arg Asp Lys Ser Gin Ser He Leu Tyr Leu Gin Met Asn Thr Leu Arg 
95 100 105 

GCT GAG GAC AGT GCC ACT TAT TAC TGT ACA AGA GAT AGG GGG CTA CGG 387 
Ala Glu Asp Ser Ala Thr Tyr Tyr Cys Thr Arg Asp Arg Gly Leu Arg 
110 115 120 

TTC TAC TTT GAC TAC TGG GGC CAA GGC ACC ACT CTC ACA GTC TCC TCA 435 
Phe Tyr Phe Asp Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser 
125 130 135 140 

GCC AAA ACG ACA CCC CCA TCT GTC TAT CCA CTG GCC CCT GGA TCT GCT 483 
Ala Lys Thr Thr Pro Pro Ser Val Tyr Pro Leu Ala Pro Gly Ser Ala 
145 150 155 

GCC CAA ACT AAC TCC ATG GTG ACC CTG GGA TGC CTG GTC AAG GGC TAT 531 
Ala Gin Thr Asn Ser Met Val Thr Leu Gly Cys Leu Val Lys Gly Tyr 
160 165 170 

TTC CCT GAG CCA GTG ACA GTG ACC TGG AAC TCT GGA TCT CTG TCC AGC 579 
Phe Pro Glu Pro Val Thr Val Thr Trp Asn Ser Gly Ser Leu Ser Ser 
175 180 185 

GGT GTG CAC ACC TTC CCA GCT GTC CTG CAG TCT GAC CTC TAC ACT CTG 627 
Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Asp Leu Tyr Thr Leu 
190 195 200 
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AGC AGC TCA GTG ACT GTC CCC TCC AGC ACC TGG CCC AGC GAG ACC GTC 675 
Ser Ser Ser Val Thr Val Pro Ser Ser Thr Trp Pro Ser Glu Thr Val 
205 210 215 220 

ACC TGC AAC GTT GCC CAC CCG GCC AGC AGC ACC AAG GTG GAC AAG AAA 723 
Thr Cys Asn Val Ala His Pro Ala Ser Ser Thr Lys Val Asp Lys Lys 
225 230 235 

ATT GTG CCC AGG GAT TGT GGT TGT AAG CCT TGC ATA TGT ACA 765 
He Val Pro Arg Asp Cys Gly Cys Lys Pro Cys He Cys Thr 
240 245 250 



TAGTAAGAAT TC 



777 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Lys Leu Trp Leu Asn Trp He Phe Leu Val Thr Leu Leu Asn Gly 
15 10 15 

He Gin Cys Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gin 
20 25 30 

Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe 
35 40 45 

Thr Asp Tyr Tyr Met Asn Trp Val Arg Gin Pro Pro Gly Lys Ala Leu 
50 55 60 

Glu Trp Leu Gly Phe He Gly Asn Lys Ala Asn Gly Tyr Thr Thr Glu 
€5 70 75 80 

Tyr Ser Ala Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Lys Ser 
85 90 95 

Gin Ser He Leu Tyr Leu Gin Met Asn Thr Leu Arg Ala Glu Asp Ser 
100 105 110 

Ala Thr Tyr Tyr Cys Thr Arg Asp Arg Gly Leu Arg Phe Tyr Phe Asp 
115 120 125 

Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr Thr 
130 135 140 

Pro Pro Ser Val Tyr Pro Leu Ala Pro Gly Ser Ala Ala Gin Thr Asn 
145 150 155 160 

Ser Met Val Thr Leu Gly Cys Leu Val Lys Gly Tyr Phe Pro Glu Pro 
165 170 175 
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Val Thr Val Thr Trp Asn Ser Gly 
180 

Phe Pro Ala Val Leu Gin Ser Asp 
195 200 

Thr Val Pro Ser Ser Thr Trp Pro 
210 215 

Ala His Pro Ala Ser Ser Thr Lys 
225 230 



Ser Leu Ser Ser Gly Val His Thr 
165 190 

Leu Tyr Thr Leu Ser Ser Ser Val 
205 

Ser Glu Thr Val Thr Cys Asn Val 
220 

Val Asp Lys Lys He Val Pro Arg 
235 240 



Asp Cys Gly Cys Lys Pro Cys He Cys Thr 
245 250 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 16. .720 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GAATTCGCCG CCACC ATG GAT TTT CAA GTG CAG ATT TTC AGC TTC CTG CTA 51 
Met Asp Phe Gin Val Gin He Phe Ser Phe Leu Leu 
1 5 10 

ATC AGT GCT TCA GTC ATA ATG TCC AGA GGA CAA ACT GTT CTC TCC CAG 99 
He Ser Ala Ser Val He Met Ser Arg Gly Gin Thr Val Leu Ser Gin 
15 20 25 

TCT CCA GCA ATC CTG TCT GCA TCT CCA GGG GAG AAG GTC ACA ATG ACT 147 
Ser Pro Ala He Leu Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr 
30 35 40 

TGC AGG GCC AGC TCA AGT GTA ACT TAC ATT CAC TGG TAC CAG CAG AAG 195 
Cys Arg Ala Ser Ser Ser Val Thr Tyr He His Trp Tyr Gin Gin Lys 
45 50 55 60 

CCA GGT TCC TCC CCC AAA TCC TGG ATT TAT GCC ACA TCC AAC CTG GCT 243 
Pro Gly Ser Ser Pro Lys Ser Trp He Tyr Ala Thr Ser Asn Leu Ala 
65 70 75 



TCT GGA GTC CCT GCT CGC TTC AGT GGC AGT GGG TCT GGG ACC TCT TAC 
Ser Gly Val Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr 
80 85 90 



291 
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TCT CTC ACA ATC AQC AGA GTG GAG GCT GAA GAT GCT GCC ACT TAT TAC 
Ser Leu Thr He Ser Arg Val Glu Ala Glu Asp Ala Ala Thr Tyr Tyr 
95 100 105 



339 



TGC CAA CAT TGG ACT AGT AAA CCA CCG ACG TTC GGT GGA GGC ACC AAG 
Cys Gin His Trp Ser Ser Lys Pro Pro Thr Phe Gly Gly Gly Thr Lys 
110 115 120 



387 



CTG GAA ATC AAA CGG GCT GAT GCT GCA CCA ACT GTA TCC ATC TTC CCA 
Leu Glu He Lys Arg Ala Asp Ala Ala Pro Thr Val Ser He Phe Pro 
125 130 135 140 



435 



CCA TCC AGT GAG CAG TTA ACA TCT GGA GGT GCC TCA GTC GTG TGC TTC 
Pro Ser Ser Glu Gin Leu Thr Ser Gly Gly Ala Ser Val Val Cys Phe 
145 150 155 



483 



TTG AAC AAC TTC TAC CCC AAA GAC ATC AAT GTC AAG TGG AAG ATT GAT 
Leu Asn Asn Phe Tyr Pro Lys Asp He Asn Val Lys Trp Lys He Asp 
160 165 170 



531 



GGC AGT GAA CGA CAA AAT GGC GTC CTG AAC AGT TGG ACT GAT CAG GAC 579 
Gly Ser Glu Arg Gin Asn Gly Val Leu Asn Ser Trp Thr Asp Gin Asp 
175 180 185 

AGC AAA GAC AGC ACC TAC AGC ATG AGC AGC ACC CTC ACG TTG ACC AAG 627 
Ser Lys Asp Ser Thr Tyr Ser Met Ser Ser Thr Leu Thr Leu Thr Lys 
190 195 200 

GAC GAG TAT GAA CGA CAT AAC AGC TAT ACC TGT GAG GCC ACT CAC AAG 675 
Asp Glu Tyr Glu Arg His Asn Ser Tyr Thr Cys Glu Ala Thr His Lys 
205 210 215 220 

ACA TCA ACT TCA CCC ATT GTC AAG AGC TTC AAC AGG AAT GAG TGT 720 
Thr Ser Thr Ser Pro He Val Lys Ser Phe Asn Arg Asn Glu Cys 
225 230 235 

TAATAAGAAT TC 732 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Asp Phe Gin Val Gin He Phe Ser Phe Leu Leu He Ser Ala Ser 
15 10 15 

Val He Met Ser Arg Gly Gin Thr Val Leu Ser Gin Ser Pro Ala He 
20 25 30 

Leu Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Arg Ala Ser 
35 40 45 
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Ser Ser Val Thr Tyr He His Trp Tyr Gin Gin Lys Pro Gly Ser Ser 
50 55 60 

Pro Lys Ser Trp He Tyr Ala Thr Ser Asn Leu Ala Ser Gly Val Pro 
^5 70 75 80 

Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He 
85 90 95 

Ser Arg Val Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin His Trp 
100 105 110 

Ser Ser Lys Pro Pro Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
115 120 125 

Arg Ala Asp Ala Ala Pro Thr Val Ser He Phe Pro Pro Ser Ser Glu 
130 135 140 

Gin Leu Thr Ser Gly Gly Ala Ser Val Val Cys Phe Leu Asn Asn Phe 
150 155 160 



Tyr Pro Lys Asp He Asn Val Lys Trp Lys He Asp Gly Ser Glu Arg 
1€5 170 175 



Gin Asn Gly Val Leu Asn Ser Trp Thr Asp Gin Asp Ser Lys Asp Ser 
180 185 190 

Thr Tyr Ser Met Ser Ser Thr Leu Thr Leu Thr Lys Asp Glu Tyr Glu 
135 200 205 

Arg His Asn Ser Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr Ser 
210 215 220 

Pro He Val Lys Ser Phe Asn Arg Asn Glu Cys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 16. .765 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AAGCTTGCCG CCACC ATG AAG TTG TGG CTG AAC TGG ATT TTC CTT GTA ACA 51 
Met Lys Leu Trp Leu Asn Trp He Phe Leu Val Thr 
15 10 

CTT TTA AAT GGT ATC CAG TGT GAG GTG CAG CTG CTG GAG TCT GGA GGA 99 
Leu Leu Asn Gly He Gin Cys Glu Val Gin Leu Leu Glu Ser Gly Gly 
15 20 25 
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GGA CTG GTG CAG CCT GGA GGA TCT CTG AGA CTG TOT TGT GCA ACA TCT 
Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Thr Ser 
30 35 40 



147 



GGA TTC ACC TTC ACA GAC TAG TAG ATG AAT TGG GTG AGA CAG GCA CCT 
Gly Phe Thr Phe Thr Asp Tyr Tyr Met Asn Trp Val Arg Gin Ala Pro 
45 50 55 60 



195 



GGA AAG GGA CTC GAG TGG CTG GGC TTC ATC GGA AAT AAG GCA AAT GGA 
Gly Lys Gly Leu Glu Trp Leu Gly Phe lie Gly Asn Lys Ala Asn Gly 
€5 70 75 



243 



TAC ACA ACA GAG TAC TCT GCA TCT GTG AAG GGA AGA TTC ACA ATT TCC 
Tyr Thr Thr Glu Tyr Ser Ala Ser Val Lys Gly Arg Phe Thr lie Ser 
80 85 90 



291 



AGA GAC AAG AGC AAG TCC ACA CTG TAC CTG CAG ATG AAT ACA CTG CAG 
Arg Asp Lys Ser Lys Ser Thr Leu Tyr Leu Gin Met Asn Thr Leu Gin 
95 100 105 



339 



GCA GAG GAC TCT GCA ATT TAC TAC TGT ACA AGA GAC AGA GGA CTG AGA 
Ala Glu Asp Ser Ala He Tyr Tyr Cys Thr Arg Asp Arg Gly Leu Arg 
110 115 120 



387 



TTC TAC TTC GAC TAC TGG GGA CAG GGA ACA CTG GTG ACA GTG TCT TCT 435 
Phe Tyr Phe Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
125 130 135 140 

GCT AGC ACC AAG GGA CCA TCG GTC TTC CCC CTG GCC CCC TGC TCC AGG 483 
Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
145 150 155 

AGC ACC TCC GAG AGC ACA GCC GCC CTG GGC TGC CTG GTC AAG GAC TAT 531 
Ser Thr Ser Glu Ser Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
160 165 170 

TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA GGC GCT CTG ACC AGC 579 
Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
175 180 185 

GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA CTC TAC TCC 627 
Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
190 195 200 

CTC AGC AGC GTC GTG ACG GTG CCC TCC AGC AAC TTC GGC ACC CAG ACC €75 
Leu Ser Ser Val Val Thr Val Pro Ser Ser Asn Phe Gly Thr Gin Thr 
205 210 215 220 

TAC ACC TGC AAC GTA GAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG 723 
Tyr Thr Cys Asn Val Asp His Lys Pro Ser Asn Thr Lys Val Asp Lys 
225 230 235 

ACA GTT GAG CGC AAA TGT TGT GTC GAG TGC CCA CCG TGC CCG 765 
Thr Val Glu Arg Lys Cys Cys Val Glu Cys Pro Pro Cys Pro 
240 245 250 

TAATAGGAAT TC 777 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Lys Leu Trp Leu Asn Trp He Phe Leu Val Thr Leu Leu Asn Gly 
15 10 15 

He Gin Cys Glu Val Gin Leu Leu Glu Ser Gly Gly Gly Leu Val Gin 
20 25 30 

Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe 
35 40 45 

Thr Asp Tyr Tyr Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

55 60 

Glu Trp Leu Gly Phe He Gly Asn Lys Ala Asn Gly Tyr Thr Thr Glu 
70 75 80 

Tyr Ser Ala Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Lys Ser 
85 90 95 

Lys Ser Thr Leu Tyr Leu Gin Met Asn Thr Leu Gin Ala Glu Asp Ser 
100 105 110 

Ala He Tyr Tyr Cys Thr Arg Asp Arg Gly Leu Arg Phe Tyr Phe Asp 
115 120 125 

Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys 
130 135 140 

Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Glu 
"5 150 155 160 

Ser Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro 
165 170 175 

Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 
180 185 190 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val 
195 200 205 

Val Thr Val Pro Ser Ser Asn Phe Gly Thr Gin Thr Tyr Thr Cys Asn 
210 215 220 

Val Asp His Lys Pro Ser Asn Thr Lys Val Asp Lys Thr Val Glu Arg 
225 230 235 240 

Lys Cys Cys Val Glu Cys Pro Pro Cys Pro 
245 250 

(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 16. .720 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GAATTCGCCG CCACC ATG GAT TTT CAA GTG CAG ATT TTC AGC TTC CTG CTA 51 
Met Asp Phe Gin Val Gin He Phe Ser Phe Leu Leu 
15 10 

ATC AGT GCT TCA GTC ATA ATG TCC AGA GGA CAG ACT GTA CTC ACT CAG 99 
He Ser Ala Ser Val He Met Ser Arg Gly Gin Thr Val Leu Thr Gin 
15 20 25 

AGT CCA AGT AGT CTC AGT GTA AGT GTA GGT GAT AGG GTA ACT ATG ACT 147 
Ser Pro Ser Ser Leu Ser Val Ser Val Gly Asp Arg Val Thr Met Thr 
30 35 40 

TGT AGG 6CC AGT AGT AGT GTA ACT TAT ATC CAT TGG TAT CAG CAG AAA 195 
Cys Arg Ala Ser Ser Ser Val Thr Tyr He His Trp Tyr Gin Gin Lys 
45 50 55 60 

CCA GGT CTC GCC CCA AAA AGT TGG ATC TAT GCC ACT AGT AAC CTC GCC 243 
Pro Gly Leu Ala Pro Lys Ser Trp He Tyr Ala Thr Ser Asn Leu Ala 
65 70 75 



AGT GGT GTA CCA TCT AGA TTC AGT GGT AGC GGT AGT GGT ACT GAT TAT 
Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr 
80 85 90 



291 



ACT CTC ACT ATC AGT AGT CTC CAG CCA GAA GAT ATC GCC ACT TAC TAT 
Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp He Ala Thr Tyr Tyr 
95 100 105 



339 



TGC CAG CAT TGG AGT AGT AAA CCA CCA ACT TTC GGT CAG GGT ACT AAA 
Cys Gin His Trp Ser Ser Lys Pro Pro Thr Phe Gly Gin Gly Thr Lys 
110 115 120 



387 



GTA GAA GTA AAA CGT ACT GTG GCT GCA CCA TCT GTC TTC ATC TTC CCG 
Val Glu Val Lys Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro 
125 130 135 140 



435 



CCA TCT GAT GAG CAG TTG AAA TCT GGA ACT GCC TCT GTT GTG TGC CTG 
Pro Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu 
145 150 155 



4B3 



CTG AAT AAC TTC TAT CCC AGA GAG GCC AAA GTA CAG TGG AAG GTG GAT 
Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp 
160 165 170 



531 
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AAC GCC CTC CAA TCXJ GGT AAC TCC CAG GAG AGT GTC ACA GAG CAG GAC 579 
Aen Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 
175 180 185 

AGC AAG GAC AGC ACC TAG AGC CTC AGC AGC ACC CTG ACG CTG AGC AAA 627 
Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys 
190 195 200 

GCA GAC TAC GAG AAA CAC AAA GTC TAC GCC TGC GAA GTC ACC CAT CAG 675 
Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin 
205 210 215 220 

GGC CTG AGT TCG CCC GTC ACA AAG AGC TTC AAC AGG GGA GAG TGT 720 
Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
225 230 235 

TAATAGGAAT TC 732 



(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Asp Phe Gin Val Gin He Phe Ser Phe Leu Leu He Ser Ala Ser 
15 10 15 

Val He Met Ser Arg Gly Gin Thr Val Leu Thr Gin Ser Pro Ser Ser 
20 25 30 

Leu Ser Val Ser Val Gly Asp Arg Val Thr Met Thr Cys Arg Ala Ser 
35 40 45 

Ser Ser Val Thr Tyr He His Trp Tyr Gin Gin Lys Pro Gly Leu Ala 
50 55 60 

Pro Lys Ser Trp He Tyr Ala Thr Ser Asn Leu Ala Ser Gly Val Pro 
65 70 75 80 

Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr He 
85 90 95 

Ser Ser Leu Gin Pro Glu Asp He Ala Thr Tyr Tyr Cys Gin His Trp 
100 105 110 

Ser Ser Lys Pro Pro Thr Phe Gly Gin Gly Thr Lys Val Glu Val Lys 
115 120 125 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu 
130 135 140 

Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe 
145 150 155 160 

Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin 
165 170 175 
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Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lye Asp Ser 
180 185 190 

Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 
195 200 205 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser 
210 215 220 

Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28; 
GTTGGAGCTC TTGGTTCTGG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CAAGGCCTCG AGCTTTCTCA AC 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTTTGATTCT AGAGTTCGTG C 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TTGTAAAACG ACGGCCAGTG AG 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GAAACAGCTA TGACCATGAT TACG 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CAGACTCTGC AGCAGGTCCA CAG 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGACCTGCTG CAGAGTCTG 19 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GCCTGTGCTC AATATTGATG G 21 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CCGTGTTAAA GCAGAAGATA CTG 23 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GCTACTGTGA AAGAACTTGC CTC 23 
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<2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GAGCTCTTGG TTCTGGTGAC TGTGGCCCTG GCATCTGCTC ATCATGGTGG TGAGCACTTT 60 

GAAGGCGAGA AGGTGTTCCG TGTTAACGTT GAAGATGAAA ATCACATTAA CATAATCCGC 120 

GAGTTGGCCA GCACGACCCA GATTGACTTC TGGAAGCCAG ATTCTGTCAC ACAAATCAAA 180 

CCTCACAGTA CAGTTGACTT CCGTGTTAAA GCAGAAGATA CTGTCACTGT GGAGAATGTT 240 

CTAAAGCAQA ATGAACTACA ATACAAGGTA CTGATAAGCA ACCTGAGAAA T6TGGTGGAG 300 

GCTCAGTTTG ATAGCCGGGT TCGTGCAACA GGACACAGTT ATGAGAAGTA CAACAAGTOG 360 

GAAACGATAG AGGCTTGGAC TCAACAAGTC GCCACTGAGA ATCCAGCCCT CATCTCTCGC 420 

AGTGTTATCG GAACCACATT TGAGGGACGC GCTATTTACC TCCTGAAGGT TGGCAAAGCT 480 

GGACAAAATA AGCCTGCCAT TTTCATGGAC TGTGGTTTCC ATGCCAGAGA GTGGATTTCT 540 

CCTGCATTCT GCCAGTGGTT TGTAAGAGAG GCTGTTCGTA CCTATGGACG TGAGATCCAA 600 

GT6ACAGAGC TTCTCGACAA GTTAGACTTT TATGTCCTGC CTGTGCTCAA TATTGATGGC 660 

TACATCTACA CCTGGACCAA GAGCCGATTT TGGAGAAAGA CTCGCTCCAC CCATACTGGA 720 

TCTAGCTGCA TTGGCACAGA CCCCAACAGA AATTTTGATG CTGGTTGGTG TGAAATTGGA 780 

GCCTCTCGAA ACCCCTGTGA TGAAACTTAC TGTGGACCTG CCGCAGAGTC TGAAAAGGAA 840 

ACCAAGGCCC TGGCTGATTT CATCCGCAAC AAACTCTCTT CCATCAAGGC ATATCTGACA 900 

ATCCACTCGT ACTCCCAAAT GATGATCTAC CCTTACTCAT ATGCTTACAA ACTCGGTGAG 960 

AACAATGCTG AGTTGAATGC CCTGGCTAAA GCTACTGTGA AAGAACTTGC CTCACTGCAC 1020 

GGCACCAAGT ACACATATGG CCCGGGAGCT ACAACAATCT ATCCTGCTGC TGGGGGCTCT 1080 

GACGACTGGG CTTATGACCA AGGAATCAGA TATTCCTTCA CCTTTGAACT TCGAGATACA 1140 

GGCAGATATG GCTTTCTCCT TCCAGAATCC CAGATCCGGG CTACCTGCGA GGAGACCTTC 1200 

CTGGCAATCA AGTATGTTGC CAGCTACGTC CTGGAACACC TGTACTAGTT GAGAAAGCTC 1260 

1263 
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(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Glu Leu Leu Val Leu Val Thr Val Ala Leu Ala Ser Ala His His Gly 
15 10 15 

Gly Glu His Phe Glu Gly Glu Lys Val Phe Arg Val Asn Val Glu Asp 
20 25 30 

Glu Asn His lie Asn He He Arg Glu Leu Ala Ser Thr Thr Gin He 
35 40 45 

Asp Phe Trp Lys Pro Asp Ser Val Thr Gin He Lys Pro His Ser Thr 
50 55 60 

Val Asp Phe Arg Val Lys Ala Glu Asp Thr Val Thr Val Glu Asn Val 
65 70 75 80 

Leu Lys Gin Asn Glu Leu Gin Tyr Lys Val Leu He Ser Asn Leu Arg 
85 90 95 

Asn Val Val Glu Ala Gin Phe Asp Ser Arg Val Arg Ala Thr Gly His 
100 105 110 

Ser Tyr Glu Lys Tyr Asn Lys Trp Glu Thr He Glu Ala Trp Thr Gin 
115 120 125 

Gin Val Ala Thr Glu Asn Pro Ala Leu He Ser Arg Ser Val He Gly 
130 135 140 

Thr Thr Phe Glu Gly Arg Ala He Tyr Leu Leu Lys Val Gly Lys Ala 
145 150 155 160 

Gly Gin Asn Lys Pro Ala He Phe Met Asp Cys Gly Phe His Ala Arg 
165 170 175 

Glu Trp He Ser Pro Ala Phe Cys Gin Trp Phe Val Arg Glu Ala Val 
180 185 190 

Arg Thr Tyr Gly Arg Glu He Gin Val Thr Glu Leu Leu Asp Lys Leu 
195 200 205 

Asp Phe Tyr Val Leu Pro Val Leu Asn He Asp Gly Tyr He Tyr Thr 
210 215 220 

Trp Thr Lys Ser Arg Phe Trp Arg Lys Thr Arg Ser Thr His Thr Gly 
225 230 235 240 
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Ser Ser Cys lie 61y Thr Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp 
245 250 255 

Cys Glu He Gly Ala Ser Arg Asn Pro Cys Asp Glu Thr Tyr Cys Gly 
260 265 270 

Pro Ala Ala Glu Ser Glu Lys Glu Thr Lys Ala Leu Ala Asp Phe He 
275 280 285 

Arg Asn Lys Leu Ser Ser He Lys Ala Tyr Leu Thr He His Ser Tyr 
290 295 300 

Ser Gin Met Met He Tyr Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu 
305 310 315 320 

Asn Asn Ala Glu Leu Asn Ala Leu Ala Lys Ala Thr Val Lys Glu Leu 
325 330 335 

Ala Ser Leu His Gly Thr Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr 
340 345 350 

He Tyr Pro Ala Ala Gly Gly Ser Asp Asp Trp Ala Tyr Asp Gin Gly 
355 360 365 

He Arg Tyr Ser Phe Thr Phe Glu Leu Arg Asp Thr Gly Arg Tyr Gly 
370 375 380 

Phe Leu Leu Pro Glu Ser Gin He Arg Ala Thr Cys Glu Glu Thr Phe 
365 390 395 400 

Leu Ala He Lys Tyr Val Ala Ser Tyr Val Leu Glu His Leu Tyr 
405 410 415 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GCCGGGTTTG CGCAACTGGT CACTCTTACG AGAAG 35 
(2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCGGAATTCT TATTAGTTCA GGTCCTCCTC AGAGATCAGC TTCTGCTCCT CGAACTCATG 
GTGGTGATGG TGGTGGTACA GGTGTTCC 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TTAGCGGATC CTGCCTGACG GT 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GGCTGGATTC TCAGTGGCGA CTT 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
ACCTCTAGGG TCCCCAATTA 
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(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE OIARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
CAAGTCGCCA CTGAGAATCC AGC 23 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(ix) FEATURE: 

(A) NAME/KEY: CDS ' 

(B) LOCATION :1. .1047 

(ix) FEATURE: 

(A) NAME/KEY: inat_peptide 

(B) LOCATION: 67. .1047 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

ATG AAA TAC CTA TTG CCT ACG GCA GCC OCT GGA TTG TTA TTA CTC GCT 4B 
Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

GCC CAA CCA GCC ATG GCG GCA ACT GGT CAC TCT TAC GAG AAG TAC AAC 96 
Ala Gin Pro Ala Met Ala Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn 
-5 1 5 10 

AAG TGG GAA ACG ATA GAG GCT TGG ACT CAA CAA GTC GCC ACT GAG AAT 144 
Lys Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn 
15 20 25 

CCA GCC CTC ATC TCT CGC AGT GTT ATC GGA ACC ACA TTT GAG GGA CGC 192 
Pro Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg 
30 35 40 

GCT ATT TAC CTC CTG AAG GTT GGC AAA GCT GGA CAA AAT AAG CCT GCC 240 
Ala He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala 
45 50 55 
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ATT TTC ATG GAC TGT GGT TTC CAT GCC AGA GAG TGG ATT TCT CCT GCA 288 
lie Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala 
60 65 70 

TTC TGC CAG TGG TTT 6TA AGA GAG GCT GTT CGT ACC TAT GGA CGT GAG 336 
Phe Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu 
75 80 85 90 

ATC CAA GTG ACA GAG CTT CTC GAC AAG TTA GAC TTT TAT GTC CTG CCT 384 
He Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro 
95 100 105 

GTG CTC AAT ATT GAT GGC TAC ATC TAC ACC TGG ACC AAG AGC CGA TTT 432 
Val Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe 
110 115 120 

TGG AGA AAG ACT CGC TCC ACC CAT ACT GGA TCT AGC TGC ATT GGC ACA 480 
Trp Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr 
125 130 135 

GAC CCC AAC AGA AAT TTT GAT GCT GGT TGG TGT GAA ATT GGA GCC TCT 528 
Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser 
140 145 150 

CGA AAC CCC TGT GAT GAA ACT TAC TGT GGA CCT GCC GCA GAG TCT GAA 576 
Arg Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu 
155 160 165 170 

AAG GAG ACC AAG GCC CTG GCT GAT TTC ATC CGC AAC AAA CTC TCT TCC 624 
Lys Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser 
175 180 185 

ATC AAG GCA TAT CTG ACA ATC CAC TCG TAC TCC CAA ATG ATG ATC TAC 672 
He Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr 
190 195 200 

CCT TAC TCA TAT GCT TAC AAA CTC GGT GAG AAC AAT GCT GAG TTG AAT 720 
Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn 
205 210 215 

GCC CTG GCT AAA GCT ACT GTG AAA GAA CTT GCC TCA CTG CAC GGC ACC 768 
Ala Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr 
220 225 230 

AAG TAC ACA TAT GGC CCG GGA GCT ACA ACA ATC TAT CCT GCT GCT GGG 816 
Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly 
235 240 245 250 

GGC TCT GAC GAC TGG GCT TAT GAC CAA GGA ATC AGA TAT TCC TTC ACC 864 
Gly Ser Asp Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr 
255 260 265 

TTT GAA CTT CGA GAT ACA GGC AGA TAT GGC TTT CTC CTT CCA GAA TCC 912 
Phe Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser 
270 275 280 



CAG ATC CGG GCT ACC TGC GAG GAG ACC TTC CTG GCA ATC AAG TAT GTT 960 
Gin He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val 
285 290 295 
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GCC AGC TAG GTC CTG GAA CAC CTG TAG CAC GAG GAT GAG GAG GAT GAG 1008 
Ala Ser Tyr Val Leu Glu His Leu Tyr His His His His His His Glu 
300 305 310 

TTG GAG GAG GAG AAG GTG ATC TGT GAG GAG GAG GTG AAG TAATAA 1053 
Phe Glu Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn 
315 320 325 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENGE CHARAGTER ISTI GS : 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLEGULE TYPE: protein 

(xi) SEQUENGE DESGRIPTION: SEQ ID NO: 47: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

Ala Gin Pro Ala Met Ala Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn 
-5 1 5 10 

Lys Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn 
15 20 25 

Pro Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg 
30 35 40 

Ala He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala 
45 50 55 

He Phe Met Asp Gys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala 
60 65 70 

Phe Gys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu 
75 80 85 90 

He Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro 
95 100 105 

Val Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe 
110 115 120 

Trp Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Gys He Gly Thr 
125 130 135 

Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp Gys Glu He Gly Ala Ser 
140 145 150 

Arg Asn Pro Gys Asp Glu Thr Tyr Gys Gly Pro Ala Ala Glu Ser Glu 
155 160 165 170 

Lys Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser 
175 180 185 

He Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr 
190 195 200 
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Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn 
205 210 215 

Ala Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr 
220 225 230 

Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr lie Tyr Pro Ala Ala Gly 
235 240 245 250 

Gly Ser Asp Asp Trp Ala Tyr Asp Gin Gly lie Arg Tyr Ser Phe Thr 
255 260 265 

Phe Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser 
270 275 280 

Gin lie Arg Ala Thr Cy^ "^V^ Val 

285 290 295 

Ala Ser Tyr Val Leu Glu His Leu Tyr His His His His His His Glu 
300 305 310 

Phe Glu Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn 
315 320 325 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GTTATTACTC GCTGCCCAAC CAGCCATGGC G 31 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
(6) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CTCTAGGAAT TCTTATTAGT ACAGGTGTTC C^VGGACGTAG C 41 
(2) INFORMATION FOR SEQ ID NO: 50: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .987 



(ix) FEATURE: 

(A) NAME/KEY: inat_peptide 

(B) LOCATION: 67. .987 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

ATG AAA TAC CTA TTG CCT ACG OCA GCC GCT GGA TTG TTA TTA CTC GCT 48 
Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

GCC CAA CCA GCC ATO GCG GCA ACT GGT CAC TCT TAC GAG AAG TAC AAC 96 
Ala Gin Pro Ala Met Ala Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn 
-5 1 5 10 

AAG TGG GAA ACG ATA GAG GCT TGG ACT CAA CAA GTC GCC ACT GAG AAT 144 
Lys Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn 
15 20 25 

CCA GCC CTC ATC TCT CGC AGT GTT ATC GGA ACC ACA TTT GAG GGA CGC 192 
Pro Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg 
30 35 40 

GCT ATT TAC CTC CTG AAG GTT GGC AAA GCT GGA CAA AAT AAG CCT GCC 240 
Ala He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala 
45 50 55 

ATT TTC ATG GAC TGT GGT TTC CAT GCC AGA GAG TGG ATT TCT CCT GCA 288 
He Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala 
60 65 70 

TTC TGC CAG TGG TTT GTA AGA GAG GCT GTT CGT ACC TAT GGA CGT GAG 336 
Phe Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu 
75 80 85 90 

ATC CAA GTG ACA GAG CTT CTC GAC AAG TTA GAC TTT TAT GTC CTG CCT 384 
He Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro 
95 100 105 

GTG CTC AAT ATT GAT GGC TAC ATC TAC ACC TGG ACC AAG AGC CGA TTT 432 
Val Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe 
110 115 120 

TGG AGA AAG ACT CGC TCC ACC CAT ACT GGA TCT AGC TGC ATT GGC ACA 480 
Trp Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr 
125 130 135 



SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 



-149- 



PCT/GB96/01975 



GAC CCC AAC AGA AAT TTT GAT GCT GGT TGG TGT GAA ATT GGA GCC TCT 528 
Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser 
140 145 150 

CGA AAC CCC TGT GAT GAA ACT TAC TGT GGA CCT GCC GCA GAG TCT GAA 576 
Arg Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu 
155 160 165 170 

AAG GAG ACC AAG GCC CTG GCT GAT TTC ATC CGC AAC AAA CTC TCT TCC 624 
Lys Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser 
175 180 185 

ATC AAG GCA TAT CTG ACA ATC CAC TCG TAC TCC CAA ATG ATG ATC TAC 672 
He Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr 
150 195 200 



CCT TAC TCA TAT GCT TAC AAA CTC GGT GAG AAC AAT GCT GAG TTG AAT 720 
Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn 
205 210 215 

GCC CTG GCT AAA OCT ACT GTG AAA GAA CTT GCC TCA CTG CAC GGC ACC 768 
Ala Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr 
220 225 230 

AAG TAC ACA TAT GGC CCG GGA GCT ACA ACA ATC TAT CCT GCT GCT GGG 816 
Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly 
235 240 245 250 

GGC TCT GAC GAC TGG GCT TAT GAC CAA GGA ATC AGA TAT TCC TTC ACC 864 
Gly Ser Asp Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr 
255 260 265 

TTT GAA CTT CGA GAT ACA GGC AGA TAT GGC TTT CTC CTT CCA GAA TCC 912 
Phe Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser 
270 275 280 

CAG ATC CGG GCT ACC TGC GAG GAG ACC TTC CTG GCA ATC AAG TAT GTT 960 
Gin He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val 
285 290 295 

GCC AGC TAC GTC CTG GAA CAC CTG TAC TAATAAGAAT TC 999 
Ala Ser Tyr Val Leu Glu His Leu Tyr 
300 305 



(2) INFORMATION FOR SEQ ID NO: 51 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 "15 -10 
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Ala Gin Pro Ala Met Ala Ala Thr Oly His Ser Tyr Glu Lys Tyr Asn 
'5 1 5 10 

Lys Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn 
15 20 25 

Pro Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg 
30 35 40 

Ala He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala 
45 50 55 

He Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala 
€0 65 70 

Phe Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu 
75 80 85 90 

He Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro 
55 100 105 



Val Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe 
110 115 3^20 

Trp Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr 
125 130 135 

Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser 
140 145 

Arg Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu 
155 160 165 170 

Lys Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser 
175 180 185 

He Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr 
ISO 195 200 

Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn 
205 210 215 

Ala Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr 
220 225 230 

Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly 
235 240 245 250 

Gly Ser Asp Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr 
255 260 265 

Phe Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser 
270 275 280 

Gin He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val 
285 290 295 



Ala Ser Tyr Val Leu Glu His Leu Tyr 
300 305 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
CCAACCAGCC ATGGOGCATC ATGGTGGTGA GCAC 34 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGCTGGATTC TCAGTGGCGA CTT 23 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GGAGAAAGCC ATATCTGCCT G 21 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 B4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(ix} FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .1272 

(ix) FEATURE: 

(A) NAME/KEY: matjeptide 

(B) LOCATION: 352. .1272 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

ATG AAA TAC CTA TTG CCT ACG GCA GCC OCT GGA TTG TTA TTA CTC GCT 48 
Met LyB Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-117 -115 -110 -105 

GCC CAA CCA GCC ATG GCG CAT CAT GGT GGT GAG CAC TTT GAA GGC GAG 96 
Ala Gin Pro Ala Met Ala His His Gly Gly Glu His Phe Glu Gly Glu 
-100 -95 -90 

AAG GTG TTC CGT GTT AAC GTT GAA GAT GAA AAT CAC ATT AAC ATA ATC 144 
Lys Val Phe Arg Val Asn Val Glu Asp Glu Asn His He Asn He He 
-B5 -80 -75 -70 

CGC GAG TTG GCC AGC ACG ACC CAG ATT GAC TTC TGG AAG CCA GAT TCT 192 
Arg Glu Leu Ala Ser Thr Thr Gin He Asp Phe Trp Lys Pro Asp Ser 
-65 -60 -55 

GTC ACA CAA ATC AAA CCT CAC AGT ACA GTT GAC TTC CGT GTT AAA GCA 240 
Val Thr Gin He Lys Pro His Ser Thr Val Asp Phe Arg Val Lys Ala 
-50 -45 -40 

GAA GAT ACT GTC ACT GTG GAG AAT GTT CTA AAG CAG AAT GAA CTA CAA 288 
Glu Asp Thr Val Thr Val Glu Asn Val Leu Lys Gin Asn Glu Leu Gin 
-35 -30 -25 

TAC AAG GTA CTG ATA AGC AAC CTG AGA AAT GTG GTG GAG GCT CAG TTT 336 
Tyr Lys Val Leu He Ser Asn Leu Arg Asn Val Val Glu Ala Gin Phe 
-20 -15 -10 

GAT AGC CGG GTT CGT GCA ACA GGA CAC AGT TAT GAG AAG TAC AAC AAG 384 
Asp Ser Arg Val Arg Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn Lys 
-5 1 5 10 

TGG GAA ACG ATA GAG GCT TGG ACT CAA CAA GTC GCC ACT GAG AAT CCA 432 
Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn Pro 
15 20 25 

GCC CTC ATC TCT CGC AGT GTT ATC GGA ACC ACA TTT GAG GGA CGC GCT 480 
Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg Ala 
30 35 40 

ATT TAC CTC CTG AAG GTT GGC AAA GCT GGA CAA AAT AAG CCT GCC ATT 528 
He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala He 
45 50 55 
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TTC ATG GAC TGT GOT TTC CAT GCC AGA GAG TGG ATT TOT CCT GCA TTC 576 
Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp lie Ser Pro Ala Phe 
60 65 70 75 

TGC GAG TGG TTT GTA AGA GAG GOT GTT C6T ACC TAT GGA CGT GAG ATC 624 
Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu He 
80 85 90 



CAA GTG ACA GAG CTT CTC GAC AAG TTA GAC TTT TAT GTC CTG CCT GTG 672 
Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro Val 
95 100 105 

CTC AAT ATT GAT GGC TAC ATC TAC ACC TGG ACC AAG AGC CGA TTT TGG 720 
Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe Trp 
110 115 120 

AGA AAG ACT CGC TCC ACC CAT ACT GGA TCT AGC TGC ATT GGC ACA GAC 768 
Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr Asp 
125 130 135 

CCC AAC AGA AAT TTT GAT GCT GGT TGG TGT GAA ATT GGA GCC TCT CGA 816 
Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser Arg 
140 145 150 155 

AAC CCC TGT GAT GAA ACT TAC TGT GGA CCT GCC GCA GAG TCT GAA AAG 864 
Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu Lys 
160 165 170 

GAG ACC AAG GCC CTG GCT GAT TTC ATC CGC AAC AAA CTC TCT TCC ATC 912 
Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser He 
175 180 185 

AAG GCA TAT CTG ACA ATC CAC TCG TAC TCC CAA ATG ATG ATC TAC CCT 960 
Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr Pro 
190 195 200 

TAC TCA TAT GCT TAC AAA CTC GGT GAG AAC AAT GCT GAG TTG AAT GCC 1008 
Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn Ala 
205 210 215 

CTG GCT AAA GCT ACT GTG AAA GAA CTT GCC TCA CTG CAC GGC ACC AAG 1056 
Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr Lys 
220 225 230 235 

TAC ACA TAT GGC CCG GGA GCT ACA ACA ATC TAT CCT GCT GCT GGG GGC 1104 
Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly Gly 
240 245 250 

TCT GAC GAC TGG GCT TAT GAC CAA GGA ATC AGA TAT TCC TTC ACC TTT 1152 
Ser Asp Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr Phe 
255 260 265 

GAA CTT CGA GAT ACA GGC AGA TAT GGC TTT CTC CTT CCA GAA TCC CAG 1200 
Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser Gin 
270 275 280 

ATC CGG GCT ACC TGC GAG GAG ACC TTC CTG GCA ATC AAG TAT GTT GCC 1248 
He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val Ala 
285 290 295 
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AGC TAC GTC CTG 
Ser Tyr Val Leu 
300 



GAA CAC CTG TAC TAATAAGAAT TC 
Glu His Leu Tyr 
305 



1284 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-117 -115 -110 -105 

Ala Gin Pro Ala Met Ala His His Gly Gly Glu His Phe Glu Gly Glu 
-100 -95 -90 

Lys Val Phe Arg Val Asn Val Glu Asp Glu Asn His lie Asn lie lie 
-85 -80 -75 -70 

Arg Glu Leu Ala Ser Thr Thr Gin lie Asp Phe Trp Lys Pro Asp Ser 
-65 -60 -55 

Val Thr Gin He Lys Pro His Ser Thr Val Asp Phe Arg Val Lys Ala 
-50 -45 -40 

Glu Asp Thr Val Thr Val Glu Asn Val Leu Lys Gin Asn Glu Leu Gin 
-35 -30 -25 

Tyr Lys Val Leu He Ser Asn Leu Arg Asn Val Val Glu Ala Gin Phe 
-20 -15 -10 

Asp Ser Arg Val Arg Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn Lys 
-5 15 10 



Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn Pro 
15 20 25 

Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg Ala 
30 35 40 

He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala He 
45 50 55 

Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala Phe 
60 65 70 75 

Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu He 
80 85 90 

Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro Val 
95 100 105 

Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe Trp 



110 



115 



120 
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Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr Asp 
125 130 

Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser Arg 
145 150 155 

Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu Lys 

165 170 

Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser He 
175 180 185 

Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr Pro 
150 195 200 

Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn Ala 
205 210 215 

Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr Lvs 
220 225 230 235 

Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly Gly 
240 245 250 

Ser Asp Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr Phe 
255 260 265 

Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser Gin 
270 275 280 

He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val Ala 
285 290 295 

Ser Tyr Val Leu Glu His Leu Tyr 
300 305 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GGTCATAAGC CCAGTCTTTA GAGCC 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
CCTGCTGCTG GGGGCTCTAA AGACTGG 27 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1047 

(ix) FEATURE: 

(A) NAME/KEY: inatj)eptide 

(B) LOCATION: 67. .1047 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

ATG AAA TAC CTA TTG CCT ACG GCA GCC GCT G<3A TTG TTA TTA CTC GCT 48 
Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

GCC CAA CCA GCC ATG GCG GCA ACT GGT CAC TCT TAC GAG AAG TAC AAC 96 
Ala Gin Pro Ala Met Ala Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn 
-5 1 5 10 

AAG TGG GAA ACG ATA GAG GCT TGG ACT CAA CAA GTC GCC ACT GAG AAT 144 
Lys Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn 
15 20 25 

CCA GCC CTC ATC TCT CGC AGT GTT ATC GQA ACC ACA TTT GAG GGA CGC 192 
Pro Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg 
30 35 40 

GCT ATT TAC CTC CTG AAG GTT GGC AAA GCT GGA CAA AAT AAG CCT GCC 240 
Ala He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala 
45 50 55 

ATT TTC ATG GAC TGT GGT TTC CAT GCC AGA GAG TGG ATT TCT CCT GCA 288 
He Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala 
60 65 70 

TTC TGC CAG TGG TTT GTA AGA GAG GCT GTT CGT ACC TAT GGA CGT GAG 336 
Phe Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu 
75 80 85 90 

ATC CAA GTG ACA GAG CTT CTC GAC AAG TTA GAC TTT TAT GTC CTG CCT 384 
He Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro 
95 100 105 

GTG CTC AAT ATT GAT GGC TAC ATC TAC ACC TGG ACC AAG AGC CGA TTT 432 
Val Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe 
110 115 120 
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TGG AGA AAG ACT CGC TCC ACC CAT ACT GGA TCT AGC TGC ATT GGC ACA 480 
Trp Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys lie Gly Thr 
125 130 135 

GAC CCC AAC AGA AAT TTT GAT GCT GGT TGG TGT GAA ATT GGA GCC TCT 528 
Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser 
140 145 150 

GGA AAC CCC TGT GAT GAA ACT TAC TGT GGA CCT GCC GCA GAG TCT GAA 576 
Arg Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu 
155 160 165 170 



AAG GAG ACC AAG GCC CTG GCT GAT TTC ATC CGC AAC AAA CTC TCT TCC 624 
Lys Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser 
175 180 185 

ATC AAG GCA TAT CTG ACA ATC CAC TCG TAC TCC CAA ATG ATG ATC TAC 672 
He Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr 
190 195 200 

CCT TAC TCA TAT GCT TAC AAA CTC GGT GAG AAC AAT GCT GAG TTG AAT 720 
Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn 
205 210 215 

GCC CTG GCT AAA GCT ACT GTG AAA GAA CTT GCC TCA CTG CAC GGC ACC 768 
Ala Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr 
220 225 230 

AAG TAC ACA TAT GGC CCG GGA GCT ACA ACA ATC TAT CCT GCT GCT GGG 816 
Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly 
235 240 245 250 

GGC TCT AAA GAC TGG GCT TAT GAC CAA GGA ATC AGA TAT TCC TTC ACC 864 
Gly Ser Lys Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr 
255 260 265 

TTT GAA CTT CGA GAT ACA GGC AGA TAT GGC TTT CTC CTT CCA GAA TCC 912 
Phe Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser 
270 275 280 

CAG ATC CGG GCT ACC TGC GAG GAG ACC TTC CTG GCA ATC AAG TAT GTT 960 
Gin He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val 
285 290 295 

GCC AGC TAC GTC CTG GAA CAC CTG TAC CAC CAC CAT CAC CAC CAT GAG 1008 
Ala Ser Tyr Val Leu Glu His Leu Tyr His His His His His His Glu 
300 305 310 

TTC GAG GAG CAG AAG CTG ATC TCT GAG GAG GAC CTG AAC TAATAAGAAT 1057 
Phe Glu Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn 
315 320 325 

TC 1059 
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(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

Ala Gin Pro Ala Met Ala Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn 
-5 15 10 

Lys Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn 
15 20 25 

Pro Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg 

30 35 40 

Ala He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala 
45 50 55 

He Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala 
60 65 70 

Phe Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu 
75 80 85 90 

He Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro 
95 100 105 

Val Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe 
110 115 120 

Trp Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr 
125 130 135 

Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser 
140 145 150 

Arg Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu 
155 160 165 170 

Lys Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser 
175 180 185 

He Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr 
190 195 200 

Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn 
205 210 215 

Ala Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr 
220 225 230 

Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly 
235 240 245 250 

Gly Ser Lys Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr 
255 260 265 
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Phe Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser 
270 275 280 

Gin He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val 
285 290 295 

Ala Ser Tyr Val Leu Glu His Leu Tyr His His His His His His Glu 
300 305 310 

Phe Glu Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn 
315 320 325 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

GGTCATAAGC CCAGTC6CGA GAGCC 25 

(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CCTGCTGCTG GGGGCTCTCG CGACTGG 27 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .1047 
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(ix) FEATURE: 

(A) NAME/KEY: matjeptide 

(B) LOCATION: 67. ,1047 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

ATG AAA TAC CTA TTG CCT ACG GCA GCC OCT GGA TTG TTA TTA CTC GCT 48 
Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

GCC CAA CCA GCC ATG GCG GCA ACT GGT CAC TCT TAC GAG AAG TAC AAC 96 
Ala Gin Pro Ala Met Ala Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn 
-5 1 5 10 

AAG TGG GAA ACG ATA GAG GCT TGG ACT CAA CAA GTC GCC ACT GAG AAT 144 
Lys Trp Glu Thr lie Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn 
15 20 25 

CCA GCC CTC ATC TCT CGC AGT GTT ATC GGA ACC ACA TTT GAG GGA CGC 192 
Pro Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg 

30 35 40 

GCT ATT TAC CTC CTG AAG GTT GGC AAA GCT GGA CAA AAT AAG CCT GCC 240 
Ala He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala 
45 50 55 

ATT TTC ATG 6AC TGT GGT TTC CAT GCC AGA GAG TGG ATT TCT CCT GCA 288 
He Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala 
60 65 70 

TTC TGC CAG TGG TTT GTA AGA GAG GCT GTT CGT ACC TAT GGA CGT GAG 336 
Phe Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu 
"75 80 85 90 

ATC CAA GTG ACA GAG CTT CTC 6AC AAG TTA GAC TTT TAT GTC CTG CCT 384 
He Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro 
95 100 105 

GTG CTC AAT ATT GAT GGC TAC ATC TAC ACC TGG ACC AAG AGC CGA TTT 432 
Val Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe 
110 115 120 

TGG AGA AAG ACT CGC TCC ACC CAT ACT GGA TCT AGC TGC ATT GGC ACA 480 
Trp Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr 
125 130 135 

GAC CCC AAC AGA AAT TTT GAT GCT GGT TGG TGT GAA ATT GGA GCC TCT 528 
Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser 
140 145 150 



CGA AAC CCC TGT GAT GAA ACT TAC TGT GGA CCT GCC GCA GAG TCT GAA 
Arg Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu 
155 160 165 170 



576 



AAG GAG ACC AAG GCC CTG GCT GAT TTC ATC CGC AAC AAA CTC TCT TCC 
Lys Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser 
175 180 185 



624 
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ATC AAG GCA TAT CTG ACA ATC CAC TCG TAG TCC CAA ATG ATG ATC TAC 672 
He Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr 
190 195 200 

CCT TAC TCA TAT GCT TAC AAA CTC GGT GAG AAC AAT GCT GAG TTG AAT 720 
Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn 
205 210 215 

GCC CTG GCT AAA GCT ACT GTG AAA GAA CTT GCC TCA CTG CAC GGC ACC 768 
Ala Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr 
220 225 230 

AAG TAC ACA TAT GGC CCG GGA GCT ACA ACA ATC TAT CCT GCT GCT GGG 816 
Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly 
235 240 245 250 

GGC TCT CGC GAC TGG GCT TAT GAC CAA GGA ATC AGA TAT TCC TTC ACC 864 
Gly Ser Arg Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr 
255 260 265 

TTT GAA CTT CGA GAT ACA GGC AGA TAT GGC TTT CTC CTT CCA GAA TCC 912 
Phe Glu lieu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser 
270 275 280 



CAG ATC CGG GCT ACC TGC GAG GAG ACC TTC CTG GCA ATC AAG TAT GTT 960 
Gin He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val 
285 290 295 

GCC AGC TAC GTC CTG GAA CAC CTG TAC CAC CAC CAT CAC CAC CAT GAG 1008 
Ala Ser Tyr Val Leu Glu His Leu Tyr His His His His His His Glu 
300 305 310 

TTC GAG GAG CAG AAG CTG ATC TCT GAG GAG GAC CTG AAC TAATAAGAAT 1057 
Phe Glu Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn 
315 320 325 

TC 1059 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 



Ala Gin Pro Ala Met Ala Ala Thr Gly His Ser Tyr Glu Lys Tyr Asn 

-5 15 10 

Lys Trp Glu Thr He Glu Ala Trp Thr Gin Gin Val Ala Thr Glu Asn 

15 20 25 

Pro Ala Leu He Ser Arg Ser Val He Gly Thr Thr Phe Glu Gly Arg 

30 35 40 
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Ala He Tyr Leu Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala 
45 50 55 

He Phe Met Asp Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala 
€0 65 70 

Phe Cys Gin Trp Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu 
75 80 85 90 

He Gin Val Thr Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro 
95 100 105 

Val Leu Asn He Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe 
110 115 120 

Trp Arg Lys Thr Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr 
125 130 135 

Asp Pro Asn Arg Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser 
140 145 150 

Arg Asn Pro Cys Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu 
155 160 165 170 

Lys Glu Thr Lys Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser 

175 180 185 

He Lys Ala Tyr Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr 
190 195 200 

Pro Tyr Ser Tyr Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn 
205 210 215 

Ala Leu Ala Lys Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr 
220 225 230 

Lys Tyr Thr Tyr Gly Pro Gly Ala Thr Thr He Tyr Pro Ala Ala Gly 
235 240 245 250 

Gly Ser Arg Asp Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr 
255 260 265 

Phe Glu Leu Arg Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser 
270 275 280 

Gin He Arg Ala Thr Cys Glu Glu Thr Phe Leu Ala He Lys Tyr Val 
285 290 295 

Ala Ser Tyr Val Leu Glu His Leu Tyr His His His His His His Glu 
300 305 310 



Phe Glu Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn 
315 320 325 

(2) INFORMATION FOR SEQ ID NO: 65: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GGATTTGGGG GAGGAACCTG GCTTCTGCTG 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

iC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Ala Pro Pro Val Ala Gly Pro Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Val Pro Glu Val Ser Ser Val Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 68: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ .ID NO: 68: 
GCAGCAGGAT AGATTGTTGT AGC 23 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CAATCTATCC TGCTGCTGGG AACTCTCGCG 30 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GGTCATAAGC CCAGTCTTTA GAGTTCCCAG 30 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CTATCCTGCT GCTGGGAACT CTAAAGACTG 3 0 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CAATCTATCC TGCTGCTGGG ACTTCTAAAG 30 
U) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GGAATCAGAT ATTCCTTCGG CTTTGAAC 28 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

GGAATATCTG ATTCCTTC3GT C 21 



(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CAATCTATCC TGCTGCTGGG TCTTCTAAAG 30 
(2) INFORMATION FOR SEQ ID NO: 76: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
GATTGTTGTA GCTCCCGGGC 
(2) INFORMATION FOR SEQ ID- NO: 77: 

( i ) . SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GGAGCTACAA CAATCTATCC TAACGCTGGG 
(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
GGAGCTACAA CAATCTATCC TTTCGCTGGG 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECrULE TYPE: other nucleic acid 



SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 



-167- 



PCT/GB96/01975 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
GGGTAGATCA TCATTTGGGA GTACG 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
CCAAATGATG ATCTACCCTT ACAACTATGC 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

CAATCTATCC TGCTGCTGGG ACTTCTCGOG 30 

(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CATTTGGGAG TACGAGTGGA TTG 23 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CGTACTCCCA AATGATGACT TACCC 
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1. A conjugate which is substantially non-immunogenic in humans 
comprising a targeting moiety capable of binding with a tumour 
associated antigen, the targeting moiety being linked to a mutated 
form of a carboxypeptidase B (CPB) enzyme capable of converting a 
prodrug into an antineoplastic drug wherein the prodrug is not 
significantly convertible into antineoplastic drug in humans by 
natural unmutated enzyme. 

2. A conjugate according to claim 1 in which the targeting 
moiety is an antibo^. 

3. A conjugate according to claim 2 in which the antibody is a 
F(ab')2 *"tibody fragment. 

4. A conjugate according to any one of claims 1-3 In which the 
enzyme is mutated to comprise a polarity change in its active site 
such that it can turn over a prodrug with a complementary polarity. 

5. A conjugate according to claim 4 in which the enzyme is any 
one of the following pancreatic human CPB mutants: 

pancreatic human CPB having amino acid Asp 253 substituted by any one 
of Arg, Asn, Gin or Lys optionally in combination with any one or more 
amino acid substitutions selected from: 

natural amino acid Gin 54 substituted by any one of Arg, Lys or Asn; 
natural amino acid Asp 145 substituted by any one of Val, Leu, lie or 
Ala; 

natural amino acid He 201 substituted by any one of Ser or Thr; 
natural amino acid Ser 205 substituted by any one of Asn, Gin, His, 
Lys or Arg; 

natural amino acid He 245 substituted by any one of Ser, Thr, Ala, 
Val, Leu, Asn, Gin, Lys, Arg or His; 

natural amino acid Ala 248 substituted by any one of Asn, Gin, Lys, 
Arg, His, Ser or Thr; 

natural amino acid Gly 251 substituted by any one of Thr, Asn, Ser, 
Gin, His, Lys, Arg, Val, He, Leu, Met, Phe, Ala or Norleuclne; and 
natural amino acid Cys 288 substituted by any one of Ser, Thr, Ala, 
Val, Leu or He. 
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6. A conjugate according to claim 4 in which the enzyme is any 
one of the following pancreatic human CPB mutants: 

pancreatic human CPB having natural amino acid Asp 253 substituted by 
any one of Arg or Lys and natural amino acid Gly 251 substituted by 
any one of Thr, Asn^ Ser, Gin, Lys or Val, optionally in combination 
with any one or more amino acid substitutions selected from: 
natural amino acid Gin 54 substituted by Arg; 
natural amino acid Asp 145 substituted by Ala; 
natural amino acid He 201 substituted by Ser; 
natural amino acid Ser 205 substituted by Asn; 

natural amino acid He 245 substituted by any one of Ser, Ala or His; 
natural amino acid Ala 248 substituted by any one of His, Ser or Asn; 
and 

natural amino acid Cys 288 substituted by any one of Ser or Ala* 

7. A conjugate according to claim 4 in which the enzyme is any 
one of the following pancreatic human CPB mutants: 

pancreatic human CPB having natural amino acid Asp 253 substituted by 
any one of Arg or Lys and natural amino acid Gly 251 substituted by 
any one of Thr, Asn or Ser optionally in combination with any one or 
more amino acid substitutions selected from: 
natural amino acid Gin 54 substituted by Arg; 
natural amino acid Asp 145 substituted by Ala; 
natural amino acid He 201 substituted by Ser; 
natural amino acid Ser 205 substituted by Asn; 
natural amino acid He 245 substituted by Ala; 

natural amino acid Ala 248 substituted by any one of Ser or Asn; and 
natural amino acid Cys 288 substituted by Ser. 

8. A conjugate according to claim 4 in which the enzyme is any 
one of the following pancreatic human CPB mutants: 

D253K; D253R; [G251N,D253K] ; [G251T,D253K] ; IG251S,D253KJ ; 
[G251T,D253R,1; [A248S,G251T,D253K1 ; IA248N,G251N,D253K] ; 
[A248S,G251N,D253K]; or [S205N,G251N,D253K1 . 

9. A matched two component system designed for use in a host in 
which the components comprise: 
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(1) a first component that Is a targeting moiety capable of 

binding with a tumour associated antigen, the targeting moiety being 
linked to a CPB enzyme capable of converting a prodrug into an 
antineoplastic drug and; 

(li) a second component that is a prodrug convertible under the 

influence of the enzyme to the antineoplastic drug; 

vherein: 

the enzyme is a mutated form of a CPB enzyme; 

the first component is substantially non-immunogenic in the host and; 
the prodrug is not significantly convertible into antineoplastic drug 
in the host by natural unmutated host enzyme. 

10 • A two component system according to claim 9 in which the 

first component is a conjugate as defined in any one of claims 1-8 and 
the system is designed for use in a human host. 

11. A two component system according to any one of claims 9 or 
10 in which the second component comprises any one of the prodrugs 
defined in claim 12b) or 12d). 

12. Any one of the following compounds or a pharmaceutically 
acceptable salt thereof: 

a) !!-(4-{4-Ibis-(2-chloroethyl)-aminoI.3-methyl-phenoxy)- 
benzoyl ) -L-alanine ; 

b) N-[N.(4-{4-[bis-(2-chloroethyl).aminol-3-methyl.phenoxy}- 
benzoyl) -L-alanine J -L-glutamic acid; 

c ) N- ( 4- { 4- [ bis - ( 2-chloroethyl ) -amino ] -phenoxy } -benzoyl ) -L- 
alanine; or 

d) N- [N- ( 4- { 4- [ bis- (2-chloroethyl ) -amino ] -phenoxy} -benzoyl ) -L- 
alanine J -L-glutamic acid . 

13. A mutant CPB enzyme as defined in any one of claims 1 or 
4-8. 

14. A polynucleotide sequence capable of encoding a mutant CPB 
enzyme defined in claim 13. 

15. A vector comprising a polynucleotide sequence defined in 
claim 14. 

16. A host cell comprising a polynucleotide sequence defined in 
claim 14. 
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17. A method of making a mutant CPB enzyme as defined in claim 
13 which comprises expressing the enzyme from a host cell as defined 
in claim 16 and optionally at least partially purifying the enzyme. 

18. A pharmaceutical composition comprising a first component as 
defined in any of claims 9-10 and a pharmaceutically acceptable 
carrier or diluent* 

19. A pharmaceutical composition comprising a second component 
as defined in any of claims 9 or 11 and a pharmaceutically acceptable 
carrier or diluent. 

20. A method of making a conjugate vhich comprises linking a 
targeting moiety capable of binding with a tumour associated antigen 
as defined in any one of claims 1-3 and an enzyme capable of 
converting a prodrug into an antineoplastic drug wherein the enzyme is 
a mutated form of a host CFB enzyme as defined in any one of claims 1 
or 4-8. 

21. Plasmid pCG330 as deposited with the National Collection of 
Industrial and Marine Bacteria (NCIHB) under accession number NCIHB 
40694. 

22. A nucleotide sequence encoding a mature human pancreatic 
carboxypeptidase B defined in SEQ ID NO: 39 from position 109 onwards 
or a mutant thereof in which there is a cysteine residue encoded at 
position 243. 

23. A method of making human pancreatic carboxypeptidase B or a 
mutant thereof in which there is a cysteine residue encoded at 
position 243 in which the method comprises expression in a host cell 
of a nucleotide sequence encoding a mature human pancreatic 
carboxypeptidase B defined in SEQ ID NO: 39 from position 109 onwards 
or a mutant thereof in which there is a cysteine residue encoded at 
position 243. 

24. A method of treating neoplastic cells in a host in which 
the method comprises administration to said host an effective amoimt 
of a first component, allowing the first component to clear 
substantially from the general circulation , and administering an 
effective amount of a second component wherein the components form a 
system as defined in any one of claims 9-11. 



SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 



1/17 



PCT/GB96/01975 



Figure 1 



Human Pancreatic Carboxvpeptidase B 



Cloning 



Pancreas carboxvpeptidase B . 

Sacr Xbal Xbol 

BPT1 BPT2 
► - ► 

Pre Pro Mature End 



cDNA ■ 

0 45 330 1250 

BPB1 
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Figure 2 



Human Pancreatic Carbo xvpe ptldase B 



Sequencing 



1279 ^ 1280 ^ 1281 



336 ^ 676 677 



Pro Mature End 

il I # 

pBluescript ^ 285 1206 

4 ^ 

® 0 



All 6 clones have identical sequence, and all have 

0 Aspartate in the enzyme recognition site, ie Caiboxypeptidase B. 

When compared with published sequence > 

® TGC codon insert, changing polypeptide ...GSSIG,.. to ...GSSCIG... . 



SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 



3/17 



PCT/GB96/01975 




SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 



4/17 



PCT/GB96/01975 



Figure 4 



PICI266 expression vector ■ gene cloning 



TCACACTTTCCAAACCATTAGCATTTTTCTCCATAACATAACCCCATCCTCCCTGACCCTTTTTGCCCCCACTCTCTACTCTTTCTCCAT 



1170 



PtB — ^ ^1 

ACCTGTTTTTCTCCATGCACTAACACCkTCAAATACCTATTGCCTACCGCACCCCCTCCATTGTTATTACTCCCTCCCCAACCACCCATC 



1260 



Insert gene. 



net Lys Tyr Leu Lmj Pro Thr Ala Alo Alo Gly Lmj Uu Uu Lbu AIq Ala Gin Pro Akj Het 



1^ 



I Bs^lt 



)9^o\ AsuU £ooRI 



1 I i i ! 

CCI±TACCAATAGCAGATCTAAT6TCTCTAGATGTTACCTCCACTTCCAAGAATTCCTACACTCGACATTATATTACTAATTAATTCCCC 



1350 



Alo 



Vol Pro lie Ala AspLMjHetSerUuAsp Vol ThrSerSerSerLyaAsnSer 



ACCCTAGAGGTCCCCTTTTTTATTTTAAAAAGCATGCGGATCCCTCGGAAATACAGGAACGCACGCTGGATGGCCCTTCGCTGGGATGGT 



1440 
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Figure 6 Growth medium 



Component 


Concentration g/l 
deionised water 


Potassium di-hydrogen orthophosphate 


3.0 


di-sodium hydrogen orthophosphate 


6,0 


Sodium chloride 


0.5 


Casein hydrolysate 


2.0 


Ammonium sulphate 


10.0 


Glycerol 


35.0 


Yeast extract 


20.0 


Magnesium sulphate heptahydrate 


0.5 


Calcium chloride di-hydrate 


0.03 


Thiamine 


0.008 


Iron sulphate heptahydrate 


0.04 


Citric acid 


0.02 


Trace element solution (TES)* 


0.5 ml/1 


Tetracycline hydrochloride 


0.01 


* Trace element solution (TES) 


Component 


mg per 10ml de-ionised 
water 


Aluminium chloride hexahydrate 


2.0 


Cobalt chloride hexahydrate 


0.8 


Potassium chromium sulphate 
dodecahydrate 


0.2 


Copper chloride dihydrate 


0.2 


Boric acid 


0.1 


Potassium iodide 


2.0 


Manganese sulphate monohydrate 


2.0 


Nickel sulphate hexahydrate 


0.09 


Sodium molybdate dihydrate 


0.4 


Zinc sulphate heptahydrate 


0.4 



SUBSTITUTE SHEET (RULE 26) 



wo 97/07769 



7/17 



PCT/GB96/01975 



Figure 7 

O COOCHjPh 



COOCHjPh g JL 



X 

1 2a - c 3 a - c 

CI CI 
b < r^r. ?i ^^^^^ ^ ^ /r\ O COOH 

> O X > O X 

CI CI 

^^■^ 5a-c 

In 2-4: 

a XsCOOCH2Ph 

b x = CH2COOCH2Ph in 5a, X = COOH 

in 5b. X= CHpCOOH 

C X=(CH2)2NH N_No^ 2 ^ 

H 

a ; CHCI3 . Et3N b) 6. CHOa , EDCI. DMAP c) Pd/C H2, ACOEt , MeOH 

c 

6 : N-^'^OH . 1HCI 

5 

ci^ 
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Figure 8 
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Figure 9 
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Figure 10 
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b 



I 



c 

CHjPh 
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Figure 11 
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Figure 12 

R ? /(Pa) 

B0CNI4-L-C0,H + Y-< ► BOCNH— L-CO- Y-< 
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Figure 14 
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